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invention are a nucleic acid construct comprising a nucleic acid which encodes a UCP3 protein and a host cell; a host cell comprising 
the nucleic acid construct which encodes UCP3; and a method for producing mammalian UCP3 comprising introducing into a host cell 
the nucleic acid construct which encodes UCP3 whereby the nucleic acid is expressed. The present invention also relates to isolated or 
recombinantly produced UCP3 protein and functional portions thereof. Also encompassed by the invention are a method of identifying 
an inhibitor (e.g., antibody) or enhancer of UCP3 expression and/or function, and the use of UCP3 inhibitors and enhancers. The present 
invention also relates to a method of detecting UCP3 in a sample obtained from a individual. 



> 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT. 



AL 


Albania 


ES 


Spain 


LS 


Lesotho 


SI 


Slovenia 


AM 


Armenia 


FI 


Finland 


LT 


Lithuania 


SK 


Slovakia 


AT 


Austria 


FR 


France 


LU 


Luxembourg 


SN 


Senegal 


AU 


Australia 


GA 


Gabon 


LV 


Latvia 


sz 


Swaziland 


A2 


Azerbaijan 


GB 


United Kingdom 


MC 


Monaco 


TD 


Chad 


BA 


Bosnia and Herzegovina 


GE 


Georgia 




"Republic of MohSova 


TG 


Togo 


BB 


Barbados 


GH 


Ghana 


MG 


Madagascar 


TJ 


Tajikistan 


BE 


Belgium 


GN 


Guinea 


MK 


The former Yugoslav 


TM 


Turkmenistan 


BF 


Burkina Faso 


GR 


Greece 




Republic of Macedonia 


v TR 


Turkey 


BG 


Bulgaria 


HU 


Hungary 


ML 


Mali 


TT 


Trinidad and Tobago 


BJ 


Benin 


IE 


Ireland 


MN 


Mongolia 


UA 


Ukraine 


BR 


Brazil 


IL 


Israel 


MR 


Mauritania 


UG 


Uganda 


BY 


Belarus 


15 


Iceland 


MW 


Malawi 


us 


United States of America 


CA 


Canada 


IT 


Italy 


MX 


Mexico 


uz 


Uzbekistan 


CF 


Central African Republic 


JP 


Japan 


NE 


Niger 


VN 


Viet Nam 


CG 


Congo 


KE 


Kenya 


NL 


Netherlands 


YU 


Yugoslavia 


CH 


Switzerland 




Kyrgyzstan 


NO 


Norway 


zw 


Zimbabwe 


d 


Cdte d'lvohc 


KP 


Democratic People's 


NZ 


"New Zealand 






CM 


Cameroon 




Republic of Korea 


PL 


"Poland 






CN 


China 


KR 


Republic of Korea 


FT 


Portugal 






cu 


Cuba 


KZ 


Kazakstan 


RO 


Romania 






cz 


Czech Republic 


LC 


Saint Lucia 


~W 


Russian Federation 








Germany 


U 


Liechtenstein 




Sedan 






DK 


Denmark 


LK 


Sri Lanka 


SE 


"Sweden 






EE 


Estonia 


LR 


Liberia 


SG 


Singapore 







'4 



WO 98/45438 



PCT/US98/06959 



-1- 

UCP3 : AN UNCOUPLING PROTEIN HOMOLOGUE 

RELATED APPLICATIONS 

This application is a Continuation- in-Part of 
Application No. 08/892,745 entitled "UCP3 : An Uncoupling 
5 Protein Homologue Expressed Selectively and Abundantly in 
Skeletal Muscle and Brown Adipose Tissue" filed July 15, 
1997 and claims benefit of U.S. provisional application 
number 60/046,254 entitled "Discovery of an Alternative 
Form of UCP3, Designated UCP3-Short Form (UCPsh) » filed May 

10 12, 1997 and U.S. provisional application number 

60/043,447, entitled "An Uncoupling Protein Homologue 
Expressed Selectively and Abundantly in Skeletal Muscle and 
Brown Adipose Tissue", filed April 9, 1997. The teachings 
of Application No. 08/892,745, U.S. provisional application 

15 number 60/043,447 and U.S. provisional application number 
60/046,254 are incorporated herein by reference in their 
entirety. 

GOVERNMENT FUNDING 

This work was supported in part by the National 
20 Institutes of Health Grants DK02119 and DK49569. 

Therefore, the U.S. Government has certain rights in the 
invention. 

BACKGROUND 

Calories are expended by mitochondria in a highly 
25 regulated fashion. Oxidation of fuels by the electron 

transport chain generates a proton electrochemical gradient 
across the inner mitochondrial membrane. Re-entry of 
protons via ATP synthesis drives conversion of ADP to ATP. 
Uncoupling proteins (UCPs) axe inner mitochondrial membrane 
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transporters which dissipate the proton gradient, releasing 
stored energy as heat (Nicholls, D.G., et al., Physiol. 
Rev., 64:1-64 (1984); Klingenberg, M . , et al., TrnedS 
Biochem. Sci . , 15:108-112 (1990)). For this reason, UCPs 
5 are potentially important determinants of metabolic 
efficiency. UCP1, the first uncoupling protein to be 
identified (Lin, C.S., et, al., FEBS Lett. , 113:299-303 
(1980); Jacobsson, A., et al . , J. Biol. Chem. , 260:16250- 
16254 (1985); Bouillaud, F. , et al., J. Biol. Chem. , 

10 261 : 1487-1490 (1986)), is expressed exclusively in brown 

adipose tissue, an important site of energy expenditure in 
rodents (Himms-Hagen, J . , Prog. Lipid Res., 25:67-115 
(1989) ) . However, UCP1 may be of lesser importance in 
humans, in whom the amount of brown adipose tissue is 

15 limited. A second uncoupling protein, referred to UCP2, 
was recently identified (Fleury, C, et al., Nature 
Genetics, 15:269-272 (1997)) or UCPH (Gimeno, R.E., et al . , 
Diabetes, 46:900-906 (1997)). In contrast with UCP1, UCP2 
is expressed in many tissues, including sites not thought 

2 0 to mediate energy expenditure which occurs in response to 

environmental temperature or diet (adaptive thermogenesis) . 

A greater understanding of the genes involved in 
metabolism will provide new approaches and targets for 
regulating energy expenditure in mammals. 

25 SUMMARY OF THE INVENTION 

The present invention relates to an uncoupling protein 
(UCP3) gene which is selectively expressed in skeletal 
muscle and brown fat, two tissues involved in energy 
expenditure in mammals. In addition, the invention relates 

3 0 to an alternative form of UCP3 designated UCP3-si^ort form 

(UCP3sh) , which is also expressed in skeletal muscle. 
Skeletal muscle particularly has a capacity for energy 
-expenditure^, or adaptive thermogenesis, in iaumans^ 
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As used herein, "UCV3" refers to UCP3 arid UCP3sh. In 
particular, the present invention relates to isolated 
(e.g., purified, essentially pure) nucleic acids 
(oligonucleotides, nucleotide sequences) which encode a 
5 mammalian (e.g., human) UCP3 protein, and include for 
example, nucleic acids (DNA, RNA) obtained from natural 
sources, recombinantly produced or chemically synthesized. 
The nucleic acids of the present invention include nucleic 
acids encoding human UCP3 (SEQ ID NO: 1), human UCP3sh (SEQ 

10 ID NO: 2), mouse UCP3 (SEQ ID NO: 7) and characteristic 
portions thereof (e.g., probes, primers). The invention 
also includes complementary sequences (i.e., a complement) 
of SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO : 7 and *r 
characteristic portions thereof. The nucleic acids of the 

15 present invention encompass nucleic acids encoding a human 
UCP3 amino acid sequence (SEQ ID NO: 3), a human UCP3sh 
form amino acid sequence (SEQ ID NO: 4) , a mouse UCP3 amino 
acid sequence (SEQ ID NO: 8) and characteristic portions 
thereof . 

2 0 The present invention further relates to isolated, 

recombinantly produced or synthetic nucleic acids which 
hybridize to the nucleic acids described herein (e.g., SEQ 
ID NO: 1, SEQ ID NO: 2, SEQ ID NO : 7 or characteristic 
portions thereof) and encode UCP3 protein (a protein having 
25 the same amino acid sequence as the amino acid sequences 
included herein and/or a protein which exhibits the same 
characteristics as the UCP3 protein described herein) . In 
particular, the invention relates to nucleic acids which 
hybridize, under moderate or high stringency, conditions, to 

3 0 SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 7, characteristic 

portions thereof or other sequences which encode, UCP3 . 

Also encompassed by the present invention is a nucleic 
acid construct comprising nucleic acid which encodes a UCP3 
protein (e.g., SEQ ID NO; 1, SEQ ID NO: 2, SEQ ID NO: 7 and 
35 characteristic portions thereof) , wherein the nucleic acid 
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of the construct is expressed when the construct is present 
in an appropriate host cell. In one embodiment, the 
nucleic acid construct of the present invention is operably 
linked to exogenous regulatory sequence (s) such as a 
5 promoter and/or enhancer, whereby mammalian UCP3 is 

expressed when the host cell is maintained under conditions 
suitable for expression. The present invention also 
relates to a host cell comprising nucleic acid encoding 
mammalian UCP3 protein. 

10 Also encompassed by the present invention is a method 

for producing a mammalian UCP3 protein (human) . In the 
method, a nucleic acid construct comprising a nucleotide 
sequence (DNA, RNA) which encodes a mammalian UCP3 protein 
is introduced into a host cell, resulting in production of 

15 a recombinant host cell which contains a UCP3 coding 
sequence operably linked to an (i.e., at least one) 
expression control sequence. The host cells produced are 
maintained in a suitable medium under conditions 
appropriate for the nucleotide sequence to be expressed, 

2 0 whereby the encoded UCP3 is produced. 

The present invention also relates to isolated (e.g., 
purified, essentially pure) UCP3 protein and includes, for 
example, UCP3 protein obtained from natural sources, 
recombinantly produced or chemically synthesized. For 
25 example, the UCP3 protein can be human UCP3 protein (SEQ ID 
NO: 3), human UCP3sh (SEQ ID N0:4), mouse UCP3 protein (SEQ 
ID NO: 8) or functional portions thereof. 

The present invention also pertains to a method of 
identifying agents which modulate or alter (e.g., inhibit 

3 0 or enhance) UCP3 activity. An inhibitor of UCP3 interferes 

(partially or completely) with the function or bioactivity 
of UCP3 , directly or indirectly. An enhancer (activator) 
of UCP3 increases or enhances the function or bioactivity 
jo£ UCP3, directly or indirectly- 
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In one embodiment, the present invention relates to a 
method of identifying an agent which alters UCP3 activity, 
wherein a nucleic acid construct comprising nucleic acid 
which encodes a mammalian UCP3 is introduced into a host 

5 cell(s) . The host cells produced are maintained under 
conditions appropriate for expression of the encoded 
mammalian UCP3 , whereby the nucleic acid is expressed. The 
host cells are then contacted with a compound to be 
assessed (an agent) and the mitochondrial electrical 
10 potential of the cells is detected in the presence of the 
compound to be assessed. Detection of a change in 
mitochondrial electrical potential in the presence of the 
agent indicates that the agent alters UCP3 activity. In a 
particular embodiment, the invention relates to a method of 

15 identifying an agent which is an activator of UCP3 activity 
wherein a nucleic acid construct comprising nucleic acid 
which encodes a mammalian UCP3 is introduced into a host 
cell (s) . The host cells produced are maintained under 
conditions appropriate for expression of the encoded 

20 mammalian UCP3 , whereby the nucleic acid is expressed. The 
host cells are then contacted with a compound to be 
assessed (an agent) and the mitochondrial electrical 
potential of the cells is detected in the presence of the 
compound to be assessed. Detection of a decrease or 

25 reduction of mitochondrial electrical potential in the 

presence of the agent indicates that the agent activates 
UCP3 activity. In another embodiment, the invention 
relates to a method of identifying an agent which is an 
inhibitor of UCP3 activity, wherein a nucleic acid 

30 construct comprising nucleic acid which encodes a mammalian 
UCP3 is introduced into a host cell (s) . The host cells 
produced are maintained under conditions appropriate for 
expression of the encoded mammalian UCP3 , whereby the 
nucleic acid is expressed. The host cells ajre then 

35 contacted with a compound to be assessed (an agent) and the 
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. mitochondrial electrical potential of the cells is detected 

in the presence of the compound to be assessed. Detection 
' of an increase of mitochondrial electrical potential ,in the 
presence of the agent indicates that the agent inhibits 
5 UCP3 activity. Methods of identifying agents which alter 
UCP3 activity can also be performed, as described herein, 
using a mixture of a membrane fraction, mitochondria and 
UCP3 (Jezek, etal., J. Biol. Chem. 271:6199-6205 (1996)). 
Also encompassed by the present invention is an agent 
10 which interacts with UCP3 directly or indirectly, and 

inhibits or enhances UCP3 function. In one embodiment, the 
agent is an inhibitor which interferes with UCP3 directly 
(e.g., by binding UCP3) or indirectly (e.g., by blocking 
the ability of UCP3 to regulate thermogenesis in skeletal 
15 muscle and/or brown adipose tissue) . In a particular 

embodiment, an inhibitor of the UCP3 protein is an antibody 
specific for UCP3 protein or a portion of a UCP3 protein; 
that is, the antibody binds the UCP3 protein. For example, 
the antibody can be specific for the human UCP3 protein 
20 (SEQ ID NO: 3, SEQ ID NO: 4), the mouse UCP3 protein (SEQ 
ID NO: 8) or functional portions thereof. Alternatively, 
the inhibitor can be an agent other than an antibody (e.g., 
small organic molecule, protein, peptide) which binds UCP3 
and blocks its activity. Furthermore, the inhibitor can be 
25 an agent which mimics UCP3 structurally but lacks its 

function. Alternatively, the inhibitor of UCP3 can be an 
agent which binds to or interacts with a molecule which 
UCP3 normally binds with or interacts with, thus blocking 
UCP3 from doing so and preventing it from exerting the 
3 0 effects it would normally exert. In another embodiment, 
the agent is an enhancer of UCP3 which increases the 
activity of UCP3 (increases thermogenesis in skeletal 
muscle and/or brown adipose tissue) , increases the length 
of time it is effective (by preventing its degradation or 
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otherwise prolonging the time during which it, is active) or 
both, either directly or indirectly. 

The -present invention also relates to antibodies 
(monoclonal or polyclonal) or functional portions thereof 
5 (e.g., an antigen binding portion such as an Fv, Fab, Fab', 
or F(ab') 2 fragment) which bind mammalian UCP3 . 

Isolation of UCP3 makes it possible to detect UCP3 in 
a sample (e.g., test sample). The present invention also 
relates to a method of detecting mammalian UCP3 in a sample 

10 (e.g., skeletal muscle, brown adipose tissue) obtained 
from an individual, such as a human. In one embodiment, 
the sample is treated to render nucleic acids in the sample 
available for hybridization to a nucleic acid probe (e.g., 
SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 7 and/or" 

15 characteristic portions thereof which bind to 

characteristic regions of UCP3 -encoding nucleic acids). 
The treated sample is combined with a nucleic acid probe 
(labeled or unlabeled) comprising or complementary to all 
or a characteristic portion of the nucleotide sequence 

20 encoding UCP3 protein, under conditions appropriate for 
hybridization of complementary nucleic acids to occur. 
Hybridization of nucleic acids in the treated sample with 
the nucleic acid probe is detected; the occurrence of 
hybridization indicates the presence of UCP3 protein in the 

25 sample. In another embodiment, the sample is contacted 

with an antibody which binds to UCP3 protein (e.g., SEQ ID 
NO: 3, SEQ ID NO : 4, SEQ ID NO : 8 or functional portions 
thereof) under conditions suitable for binding of the 
antibody to the mammalian UCP3 . Binding of the antibody to 

.3 0 a component of .the sample is detected; binding of the 
antibody to a component of the sample indicates the 
presence of UCP3 protein in the sample. 

Isolation of UCP3 also makes it possible to identify a 
promoter(s) and/or enhancer (s) of the UCP3 gene. 
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Identif ication of promoters and/or enhancers of the UCP3 
gene allow for identification of regulators of UCP3 

transcription. 

In addition, the present invention relates to 
5 transgenic non human animals (e.g., mice) which lack the 
UCP3 gene or contain a nonfunctional UCP3 gene such that 
UCP3 activity is lacking (e.g., UCP3 knockout mouse). The 
invention also relates to methods of producing UCP3 gene 
knockout animals, such as mice. UCP3 knockout mice can be 
10 used to further study the UCP3 gene and to assay for 
inhibitors and enhancers of UCP3 . 

The present invention also relates to a method of 
inhibiting (partially, completely) protein catabcilism in a 
mammal (e.g., human) comprising administering to- the mammal 
15 an effective amount of an inhibitor of UCP3 . The invention 
also relates to a method of enhancing protein catabolism in 
a mammal comprising administering to the mammal an 
effective amount of an enhancer of UCP3 . Also encompassed 
by the present invention is a method of inhibiting muscle 
20 wasting in a mammal comprising administering an effective 
amount of an inhibitor of UCP3 to the mammal. 

.Discovery of the UCP3 gene provides for selective 
modulation (enhancement, inhibition) of the expression 
and/or function of the UCP3 gene in skeletal muscle and 
25 brown fat, two tissues involved in adaptive thermogenesis . 

BRIEF DESCRIPTION OF THE FIGURES 

Figures 1A-1C are the nucleotide sequence of human 
UCP3 (SEQ ID NO: 1) and three different amino acid 
sequences (SEQ ID NO: 27, SEQ ID NO: 28 and SEQ ID NO: 29) 
30 translated from SEQ ID NO: 1. 

Figures 2A-2B are the nucleotide sequence of the UCP3- 
short form (UCP3sh) gene (SEQ ID NO: 2) and three different 
amino acid sequences (SEQ ID NO: 30, SEQ TD W>, 31 and SEQ 
ID NO: 32) translated from SEQ ID NO: 2. 
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Figure 3 is a comparison of the human UCP3 amino acid 
sequence (SEQ ID NO: 3), the human UCP3sh amino acid 
sequence (SEQ ID NO; 4) , the human UCP1 amino acid sequence 
(SEQ ID NO: 5) and the human UCP2 amino acid sequence (SEQ 
5 ID NO: 6) ; sequence alignments were performed using the 

ALIGN program (Myers, E.W., and Miller, W. , Computer Appl ^ 
Biosci. 4:11-17 (1988); and the Genbank accession numbers 
for hUCPl, hUCP2 and hUCP3 are U28480, U94592 and AF001787, 
respectively . 

10 Figure 4 is a graph of the hydrophilicity plots of 

human UCP2 and human UCP3 showing the hydrophobicity of 
protein across linear sequence; hydrophilicity plots for 
hUCP2 and hUCP3 were generated using the methods "of Kyte 
and Doolittle (Kyte, J. and Doolittle, R.F., J: Mol . Biol . * 

15 157:105-132 (1982)). 

Figures 5A-5C are the nucleotide sequence of mouse 
UCP3 (SEQ ID NO: 7) and three different amino acid 
sequences (SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35) 
translated from SEQ ID NO: 7. 

20 Figure 6 is the amino acid sequence of mouse UCP3 (SEQ 

ID NO: 8) . 

.Figure 7 is a comparison of the mouse UCP3 amino acid 
sequence (SEQ ID NO: 8) with the mouse UCP1 amino acid 
sequence (SEQ ID NO: 9), the mouse UCP2 amino acid sequence 

25 (SEQ ID NO: 10) and the human UCP3 amino acid sequence (SEQ 
ID NO: 3); the attached sequence and amino acid alignments, 
mUCP.3 is 46% identical to mUCPl, 62% identical to mUCP2 but 
is 82% identical to hUCP3 . 

Figure 8 is a graphic representation v of the genomic 

3 0 organization of the human UCP3 gene, and shows the splice 
donor sequence (SEQ ID NO: 11) and splice acceptor sequence 
*(SEQ ID NO: 12) between exons 1 and 2, the splice donor 
sequence (SEQ ID NO: 13) and splice acceptor sequence (SEQ 
ID NO: 14) ^between exons 2 and 3, the splice donor sequence 

35 (SEQ ID NO: 15) and splice acceptor sequence (SEQ ID NO: 



• 
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16) between exons 3 and 4, the splice donor sequence (SEQ 
ID NO: 17) and splice acceptor sequence (SEQ ID NO: 18) 
between exons 4 and 5, the splice donor sequence (SEQ ID 
NO: 19) and splice acceptor sequence (SEQ ID NO: 20) 
5 between exons 5 and 6, and the splice donor sequence (SEQ 
ID NO: 21) and splice acceptor sequence (SEQ ID NO: 22) 
between exons 6 and 7 of the* UCP3 gene . 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention relates to an uncoupling protein 
10 (UCP3) gene which is selectively expressed in skeletal 
muscle and brown fat, two tissues involved in energy 
expenditure in mammals. In addition, the invention relates 
to an alternative form of UCP3 designated UCP3- short form 
(UCP3sh) , which is also expressed in skeletal muscle. As 
15 used herein, "UCP3" refers to UCP3 and UCP3sh. 

The present invention relates to isolated (e.g., 
purified, essentially pure) UCP3 gene which is involved in 
regulation of thermogenesis (energy expenditure) in 
mammals. In particular, the present invention relates to 
20 nucleic acids (e.g., DNA, RNA, oligonucleotides, 

polynucleotides) or characteristic portions thereof as 
described herein, obtained from natural sources, 
recombinantly produced or chemically synthesized which 
encode a mammalian UCP3 or functional portion thereof. 
25 Nucleic acids referred to herein as "isolated" are 

.nucleic acids substantially free of (separated away' from) 
the nucleic acids of the genomic DNA or cellular RNA of 
their biological source of origin (e.g., as it exists in 
cells or in a mixture of nucleic acids such as a library) , 
3 0 and may have undergone further processing. "Isolated" 
nucleic acids include nucleic acids obtained by methods 
described herein, similar methods or other suitable 
methods, including essentially pure nucleic acids, nucleic 
acids produced by chemical synthesis or by combinations of 
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biological and chemical methods, and recornbinantly produced 
nucleic acids which are isolated (see e.g., Daugherty, B.L. 
et al., Nucleic Acids Res., 19 (9) .-2471-2476 (1991); Lewis, 
A. P. and J.S. Crowe, Gene, 101: 297-302 (1991)). Nucleic 
5 acids referred to herein as "recombinant" are nucleic acids 
which have been produced by recombinant DNA methodologies 
(recornbinantly produced) . Recombinant DNA methodologies 
include, for example, expression of UCP3 in a host cell 
containing or modified to contain DNA or RNA encoding UCP3 
10 or expression of UCP3 using polymerase chain reaction (PCR) 
techniques , 

This invention includes characteristic portions of the 
nucleic acids described herein. As used herein, a 
"characteristic portion" of nucleic acids described herein 
15 refers to portions of a nucleotide sequence which encode a 
protein or polypeptide having at least one property, 
function or activity characteristic of UCP3 protein (e.g., 
predominantly expressed in brown adipose tissue and 
skeletal muscle; activity in regulating thermogenesis in 

2 0 skeletal muscle and brown adipose tissue; selectively 

uncoupling mitochondrial respiration in brown adipocytes 
and skeletal muscle) . In addition, the term includes a 
nucleotide sequence which, through the degeneracy of the 
genetic code, encodes the same peptide as a peptide whose 

25 sequence is presented herein (e.g., SEQ ID NO: 1, SEQ ID 
NO: 2, SEQ ID NO: 7). The nucleic acids described herein 
may also contain a modification of the molecule such that 
the resulting gene product is sufficiently similar to that 
encoded by the unmodified sequence that it has essentially 

30 the same activity as the unmodified sequence. An example 
of such a modification would be a "silent" codon 
* substitution or an amino acid substitution, for instance, 
substitution of one codon encoding a hydrophobic amino acid 
to another codon encoding the same hydrophobic amino acid 

3 5 or substitution of one acidic amino acid for another acidic 
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amino acid. See Ausubel, F.M., et al., Current Protocols 
in Molecular Biology, Greene Publ . Assoc. and Wiley - 
Interscience 1989. 

In one embodiment, the nucleic acid or characteristic 
5 portion thereof encodes a protein or polypeptide having at 
least one property, activity or function characteristic of 
a mammalian UCP3 (as defined herein) , such as activity or 
function characteristic of a mammalian UCP3* (as defined 
herein) , such as activity in regulation of thermogenesis in 
10 skeletal muscle and brown adipose tissue. 

The present invention also relates more specifically 
to isolated nucleic acids or a characteristic portion 
thereof, which encode mammalian UCP3 or variants thereof . 
The invention relates to isolated nucleic acids that: 
15 (i) hybridize to (a) a nucleic acid encoding a 

mammalian UCP3 (e.g., human), such as a nucleic acid having 
a nucleotide sequence as set forth or substantially as set 
forth in Figures 1A-1C (SEQ ID NO:l), Figures 2A-2B (SEQ ID 
NO: 2) or Figures 5A-5C (SEQ ID NO: 7); (b) the complement 
20 of the sequences of (a) ; or (c) characteristic portions of 
either of the foregoing (e.g., a portion comprising the 

open reading frame) ; 

(2) encode a protein or polypeptide having at least 
one property, activity of function characteristic of a UCP3 
25 protein (e.g., predominantly expressed in brown adipose 
tissue and skeletal muscle; activity in regulating 
thermogenesis in skeletal muscle and brown adipose tissue; 
selectively uncoupling mitochondrial respiration in brown 
adipocytes and skeletal muscle ) 
30 (3) encode a polypeptide having the amino acid 

sequence of a mammalian UCP3 (e.g., SEQ ID NO: 3, SEQ ID 
NO: 4, SEQ ID NO: 7); or 

(4) have a combination of these characteristics. 
In one embodiment, the nucleic acid shares at least 
35 about 75% nucleotide sequence similarity, and more 
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preferably, at least about 90% nucleotide sequence 
similarity, to the sequence shown in Figures 1A-1C (SEQ ID 
NO:l), Figures 2A-2B (SEQ ID NO: 2) or Figures 5A-5C (SEQ 
ID NO: 7) . 

5 Isolated nucleic acids meeting these criteria include 

nucleic acids having sequences identical to sequences of 
naturally occurring mammalian UCP3 or variants of the 
naturally occurring sequences which encode mammalian 
(human) UCP3 . Such variants include mutants differing by 
10 the addition, deletion or substitution of one or more 
residues, modified nucleic acids in which one or more 
residues are modified (e.g., DNA or RNA analogs), and 
mutants comprising one or more modified residues. 

Nucleic acids of the present invention may be RNA or 
15 DNA (e.g., cDNA, genomic DNA, and synthetic DNA). The DNA 
may be double -stranded or single-stranded and, if single 
stranded, may be the coding strand or non-coding (anti- 
sense) strand. The coding sequence which encodes the 
polypeptide may be identical to the coding sequence shown 
20 in Figures 1A-1C (SEQ ID N0:1), Figures 2A-2C (SEQ ID 

NO:2) , Figures 5A-5C (SEQ ID NO: 7) or may be a different 
coding sequence which, as a result of the redundancy or 
degeneracy of the genetic code, encodes the same 
polypeptide as the polypeptide encoded by the DNA of 
25 Figures 1A-1C (SEQ ID NO:l), Figures 2A-2B (SEQ ID NO:2) or 
Figures 5A-5C (SEQ ID NO: 7) . 

The nucleic acid (polynucleotide) which encodes a UCP3 
polypeptide encoded by the UCP3 cDNA may include: only the 
coding sequence of a polypeptide; the coding sequence for a 
3 0 polypeptide and additional coding sequence such as a leader 
or secretory sequence; the coding sequence for a 
polypeptide (and optionally additional coding sequence) and 
non- coding sequence, such as introns or non-coding sequence 
5' and/or 3' of the coding sequence- 
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Nucleic acids of the present invention, including 
those which hybridize to a selected nucleic acid as 
described above, can be detected or isolated under high 
stringency conditions or moderate stringency conditions, 
for example. "High stringency conditions" and "moderate 
stringency conditions" for nucleic acid hybridizations are 
explained at pages 2.10.1-2.10.16 (see particularly 2.10.8- 
11) and pages 6.3.1-6 in Current Protocols in Molecular 
Biology (Ausubel, P.M. et al. , eds., Vol. 1; Suppl. 26, 
1991) , the teachings of which are hereby incorporated by 
reference. Factors such as probe length, base composition, 
percent mismatch between the hybridizing sequences, 
temperature and ionic strength influence the stability of 
nucleic acid hybrids. Thus, high or moderate stringency 
conditions can be determined empirically, and depend in 
part upon the characteristics of the known nucleic acid 
(e.g., DNA) and the other nucleic acids to be assessed for 
hybridization thereto. 

Nucleic acids of the present invention that are 
characterized by their ability to hybridize (e.g., under 
high or moderate stringency conditions) to (a) a nucleic 
acid encoding a mammalian UCP3 (for example, the nucleic 
acid depicted in Figures 1A-1C (SEQ ID NO:l), Figures 2A-2B 
(SEQ ID N0:2), Figures 5A-5B (SEQ ID NO: 7) or 
25 characteristic portions thereof) ; (b) the complement of the 
nucleic acids of (a); or (c) a portion thereof, can also 
encode a protein or polypeptide having at least one 
property, activity or function characteristic of a 
mammalian UCP3 as defined herein, such as activity in 
30 regulation of thermogenesis in skeletal muscle and brown 

adipose tissue. In a preferred embodiment the nucleic acid 
encodes a polypeptide which retains substantially the same 
biological function or activity as the polypeptide encoded 
by the 'DNA of "Figures 1A-1C (SEQ ID UO-.n , or Figures 2A-2B 
35 (SEQ I'D N0:2) or Figures 5A-5C (SEQ ID NO: 7) . 



20 
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Nucleic acids of the present invention can be used in 
the production of proteins or polypeptides. For example, a 
nucleic acid (e.g., DNA) encoding a mammalian UCP3 can be 
incorporated into various constructs and vectors created 
5 for further manipulation of sequences or for production of 
the encoded polypeptide in suitable host cells as described 
above . 

A further embodiment of the invention is antisense 
nucleic acid, which is complementary, in whole or in part, 

10 to a UCP3 sense strand, and can hybridize with it. The 

antisense strand hybridizes to DNA, or its RNA counterpart 
(i.e., wherein T residues of the DNA are U residues in the 
RNA counterpart). When introduced into a cell, afntisense 
nucleic acid hybridizes to and inhibits the expression of 

15 the sense strand. Antisense nucleic acids can be produced 
by s t andar d t e chni que s . 

In another embodiment, the antisense nucleic acid is 
wholly or partially complementary to and can hybridize with 
a target nucleic acid which encodes a mammalian UCP3 . For 

20 example, antisense nucleic acid can be complementary to a 
target nucleic acid having the sequence shown as the open 
reading frame in Figures 1A-1C (SEQ ID N0:1), Figures 2A-2B 
(SEQ ID N0:2) , Figures 5A-5C (SEQ ID NO: 7) or to a portion 
thereof sufficient to allow hybridization. 

25 The nucleic acids can also be used as probes (e.g., 

for in situ hybridization) to assess regulation of 
thermogenesis in skeletal muscle and/or brown adipose 
tissue. The nucleic acids can also be used as probes to 
detect and/or isolate (e.g., by hybridization with RNA or 

3 0 DNA) polymorphic or allelic variants, for example, in a 
sample (e.g., skeletal muscle, brown adipocytes, .white 
.blood cells) obtained from a host (e.g., a human) . 
Moreover, the presence or level of a particular variant in 
a sample (s) obtained from an individual., as compared with 

35 the presence or level in a sample (s) from normal 
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mammalian UCP3 and variants 01 tnaramduan uu 
preferred embodiment, the isolated proteins of the present 
invention have at least one property, activity or function 
characteristic of a mammalian UCP3 (as defined herein) , 
such as activity in regulating (mediating) thermogenesis in 
skeletal muscle and brown adipose tissue or selectively 
uncoupling mitochondrial respiration in brown adipocytes 

and in skeletal muscle. 
15 proteins or polypeptides referred to herein as 

'•isolated" are proteins or polypeptides purified to a state 
beyond that in which they exist in mammalian cells. 
"Isolated" proteins or polypeptides include proteins or 
polypeptides obtained by methods described herein, similar 
20 methods or other suitable methods. They include 

essentially pure proteins or polypeptides, proteins or 
polypeptides produced by chemical synthesis (e.g., 
synthetic peptides) , or by combinations of biological and 
chemical methods, and recombinant proteins or polypeptides 
25 which are isolated. The proteins can be obtained in an 

isolated state of at least about 50 % by weight, preferably 
at least about 75 % by weight, and more preferably, in 
essentially pure- form. Proteins or polypeptides referred 
to herein as "recombinant" are proteins or polypeptides 
30 produced by the expression of recombinant nucleic acids. 

As used herein, "mammalian UCP3" protein ref,ers to 
naturally occurring or endogenous mammalian UCP3s, proteins 
having an amino acid sequence which is the same as that of 
a naturally occurring or endogenous corresponding mammalian 
35 UCP3 (e.g., recombinant proteins), and functional variants 
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of each of the foregoing (e.g., functional fragments and/or 
mutants produced via mutagenesis and/or recombinant 
techniques) . Accordingly, as defined here in, the term 
includes mammalian UCP3 , glycosylated or unglycosylated 
5 UCP3 , polymorphic or allelic variants, and other isoforms 
of mammalian UCP3 (e.g., produced by alternative splicing- 
or other cellular processes), and functional fragments. 

Naturally occurring or endogenous mammalian UCP3s 
include wild type proteins such as mammalian UCP3, 

10 polymorphic or allelic variants and other isoforms which 
occur naturally in mammals (e.g., primate, preferably 
human, murine, bovine) . Such proteins can be recovered 
from a source in which UCP3 is naturally produced, for 
example. These mammalian proteins have the same amino acid 

15 sequence as naturally occurring or endogenous corresponding 
mammalian UCP3 . 

"Functional variants" of mammalian UCP3 include 
functional fragments, functional mutant proteins, and/or 
functional fusion proteins. Generally, fragments or 

2 0 portions of mammalian UCP3 encompassed by the present 

invention include those having one or more amino acid 
deletions relative to the naturally occurring mammalian 
UCP3 protein (such as N- terminal, C- terminal or internal 
deletions) . Fragments or portions in which only contiguous 
25 amino acids have been deleted or in which non- contiguous 
amino acids have been deleted relative to naturally 
occurring mammalian UCP3 are also encompassed by the 
invention. 

Generally, mutants or derivatives of mammalian UCP3 , 

3 0 encompassed by the present invention include natural or 

artificial variants differing by the addition, deletion 
, and/or substitution of one or more contiguous or 
non -contiguous amino acid residues, or modified 
polypeptides in which one or more residues is modified, and 
35 mutants comprising one or more modified residues. For 
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example, mutants can be natural or artificial variants of 
mammalian UCP3 which differ from naturally occurring UCP3 
by the addition, deletion and/or substitution of one or 
more contiguous or non-contiguous amino acid residues. 
5 a "functional fragment or portion", "functional 

mutant" and/or "functional fusion protein" of a mammalian 
UCP3 refers to an isolated protein or oligopeptide which 
has at least one property, activity or function 
characteristic of a mammalian UCP3 , such as activity in 
10 regulating (mediating) thermogenesis in skeletal muscle and 
brown adipose tissue or activity in selectively uncoupling 
mitochondrial respiration in brown adipocytes and in 

skeletal muscle. 

Suitable fragments or mutants can be identified by 

15 screening. For example, the N- terminal, C-terminal, or 
internal regions of the protein can be deleted in a step- 
wise fashion and the resulting protein or polypeptide can 
be screened using a suitable assay, for example, by 
measuring mitochondrial membrane potential in a host cell 

20 expressing UCP3 . Where the resulting protein displays 

activity in the assay, the resulting protein ("fragment") 

is functional. 

The invention also encompasses fusion proteins, 
comprising a mammalian UCP3 as a first moiety, linked to a 
25 second moiety not occurring in the mammalian UCP3 found in 
nature. Thus, the second moiety can be, for example, an 
amino acid, oligopeptide or polypeptide. The first moiety 
can be in an N-terminal location, C-terminal location or 
internal location of the fusion protein. In one 
30 embodiment, the fusion protein comprises a mammalian UCP3 
or portion thereof as the first moiety, and a second moiety 
comprising an affinity ligand (e.g., an enzyme, an antigen, 
epitope tag) joined to the first moiety. Optionally, the 
two components can be joined "by a litikex. 
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Examples of "human UCP3 " include proteins having an 
amino acid sequence as set forth or substantially as set 
forth in Figure 3 (SEQ ID NO: 3, SEQ ID NO: 4) and 
functional portions thereof. An example of "mouse UCP3" 
5 includes a protein having an amino acid sequence as set 
forth or substantially set forth in Figure 6 (SEQ ID NO: 
8) In preferred embodiments, a human UCP3 protein, a 
mouse UCP3 protein or a variant thereof has an amino acid 
sequence which has at least about 75% identity, and more 
10 preferably at least about 90% identity, to the protein 

shown in Figure 3 (SEQ ID NO: 3, SEQ ID NO: 4) or Figure 6 

(SEQ ID NO: 8) . 

Another aspect of the invention relates to a method of 
producing a human UCP3 or variant (e.g., portion) thereof. 
15 Recombinant protein can be obtained, for example, by the 
expression of a recombinant DNA molecule encoding a 
mammalian UCP3 or variant thereof in a suitable host cell. 

Constructs suitable for the expression of a mammalian 
UCP3 or variant thereof are also provided. The constructs 
20 can be introduced into a suitable host cell, and cells 
which express a recombinant mammalian UCP3 or variant 
thereof, can be produced and maintained in culture. Such 
cells are useful for a variety of purposes, and can be used 
in the production of protein for characterization, 
25 isolation and/or purification, (e.g., affinity 

purification), and as immunogens, for instance. Suitable 
host cells can be procaryotic, including bacterial cells 
such as E. coli, B. subtilis and or other suitable bacteria 
(e.g., Streptococci) or eucaryotic, such as fungal or yeast 
30 cells (e.g., Pichia pas tori s Aspergillus species, 

Sac char omyces cerevisiae, Schizosaccharomyces pombe, 
Neurospora crassa) , or other lower eucaryotic cells, and 
cells of higher eucaryotes such as those from insects 
{e.g., Sf9 insect cells) or mammals (e.g., Chinese hamster 
35 ovary cells (CHO) , COS cells, HuT 78 cells, 293 cells). 
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(See, e.g., Ausubel, F.M. et al., eds. Current Protocols in 
Molecular Biology, Greene Publishing Associates and John 
Wiley & Sons Inc., (1993)). 

Host cells which produce a recombinant mammalian UCP3 
5 or variants thereof can be produced as follows. For 
example, nucleic acid encoding all or part of the UCP3 
protein or a functional portion thereof can be inserted 
into a nucleic acid vector, e.g., a DNA vector, such as a 
plasmid, virus or other suitable replicon for expression. 
10 A variety of vectors is available, including vectors which 
are maintained in single copy or multiple copy, or which 
become integrated into the host cell chromosome. 

The transcriptional and/or translational signals of a 
mammalian UCP3 gene can be used to direct expression. 
15 Alternatively, suitable expression vectors for the 

expression of a nucleic acid encoding all or part of the 
desired protein are available. Suitable expression vectors 
can contain a number of components, including, but not 
limited to, one or more of the following: an origin of 
20 replication; a selectable marker gene; one or more 

expression control elements, such as a transcriptional 
control element (e.g., a promoter, an enhancer, 
terminator), and/or one or more translation signals; a 
signal sequence or leader sequence for membrane targeting 
25 or secretion (of mammalian origin or from a heterologous 

mammal or non-mammalian species). In a construct, a signal 
sequence can be provided by the vector, the mammalian UCP3 
coding sequence, or other source. 

A promoter can be provided for expression in a 
3 0 suitable host cell. Promoters can be constitutive or 

inducible. The promoter is operably linked to nucleic acid 
encoding the mammalian UCP3 or variant thereof, and is 
capable of directing expression of the encoded polypeptide 
in the ' host cell . A variety of suitable promoters for 
35 procaryotic (e.g., lac, tac, T3, T7 promoters for E. coli) 
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and eucaryotic (e.g., yeast alcohol dehydrogenase (ADH1) , 
SV4 0, CMV) hosts is available. 

In addition, the expression vectors typically comprise 
a selectable marker for selection of host cells carrying 
5 the vector, and in the case of a replicable expression 
vector, also comprise an origin of replication. Genes 
encoding products which confer antibiotic or drug 
resistance are common selectable markers and may be used in 
procaryotic (e.g., /3-lactamase gene (ampicillin 

10 resistance) , Tet gene for tetracycline resistance) and 

eucaryotic cells (e.g., neomycin (G418 or geneticin) , gpt 
(mycophenolic acid) , ampicillin, or hygromycin resistance 
genes) . Dihydrof olate reductase marker genes permit 
selection with methotrexate in a variety of hosts. Genes 

15 encoding the gene product of auxotrophic markers of the 

host (e.g., LEU2 , URA3, HIS3) are often, used as selectable 
markers in yeast. Use of viral (e.g., baculovirus) or 
phage vectors, and vectors which are capable of integrating 
into the genome of the host cell, such as retroviral 

2 0 vectors, are also contemplated. The present invention also 
relates to cells carrying these expression vectors. 

.For example, a nucleic acid encoding a mammalian UCP3 
or variant thereof is incorporated into a vector, operably -* 
linked to one or more expression control elements, and the 

25 construct is introduced into host cells which are 

maintained under conditions suitable for expression, 
whereby the encoded polypeptide is produced. The construct 
can be introduced into cells by a method appropriate to the 
host cell selected (e.g., transformation, transf ection, 

30 electroporation, infection) . For production of a protein, 
host cells comprising the construct are maintained under 
conditions appropriate for expression, (e.g., in the 
presence of inducer, suitable media supplemented with 
appropriate salts, growth factors, antibiotic, nutritional 
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supplements, etc.)- The encoded protein (e.g., human UCP3) 
can be isolated from the host cells or medium. 

Fusion proteins can also be produced in this manner. 
For example, some embodiments can be produced by the 
5 insertion of a mammalian UCP3 cDNA or portion thereof into 
a suitable expression vector, such as Bluescript®II SK +/- 
(Stratagene) , pGEX-4T-2 (Pharmacia), pcDNA-3 (Invitrogen) 
and pET-15b (Novagen) . The resulting construct can then be 
introduced into a suitable host cell for expression. Upon 
10 expression, fusion protein can be isolated or purified from 
a cell lysate by means of a suitable affinity matrix (see 
e g Current Protocols in Molecular Biology (Ausubel , F.M. 
e'tal., eds., Vol. 2, Suppl . 26, pp. 16.4 .1-16.7 .;8 (1991))- 
in addition, affinity labels provide a means of detecting a 
15 fusion protein. For example, the cell surface expression 
or presence in a particular cell fraction of a fusion 
protein comprising an antigen or epitope affinity label can 
be detected by means of an appropriate antibody. 

The UCP3 nucleic acids (DNA, RNA) and protein can be 
20 used in a variety of ways. For example, UCP3 nucleic acids 
and proteins can be used to identify agents (e.g., 
molecules) that alter or modulate (enhance, inhibit) UCP3 
expression and/or function. For example, UCP3 can be 
expressed in a host cell and effects of test compounds on 
25 mitochondrial membrane potential in the host cell could be 
assessed. In addition, evaluation of mitochondrial 
respiration could also be performed in the host cell. 

In one embodiment, the present invention relates to a 
method of identifying an agent which alters UCP3 activity, 
30 wherein a nucleic acid construct comprising nucleic acid 
which encodes a mammalian UCP3 is introduced into, a host 
cell(s) . The host cells produced are maintained under 
conditions appropriate for expression of the encoded 
mammalian UCP3_, whereby the nucleic acid is expressed. The 
35 host cells are then contacted with a compound to be 
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assessed (an agent) and the mitochondrial electrical 
potential (mitochondrial membrane potential) of the cells 
is detected in the presence of the compound to be assessed. 
Detection of a change in mitochondrial electrical potential 
5 in the presence of the agent indicates that the agent 
alters UCP3 activity. In a particular embodiment, the 
invention relates to a method of identifying an agent which 
is an activator of UCP3 activity wherein a nucleic acid 
construct comprising nucleic acid which encodes a mammalian 
10 UCP3 is introduced into a host cell (s) . The host cells 
produced are maintained under conditions appropriate for 
expression of the encoded mammalian UCP3 , whereby the 
nucleic acid is expressed. The host cells are then 
contacted with a compound to be assessed (an agent) and the 
15 mitochondrial electrical potential of the cells is detected 
in the presence of the compound to be assessed. Detection 
of a decrease or reduction of mitochondrial electrical 
potential in the presence of the agent indicates that the 
agent activates UCP3 activity. In another embodiment, the 
2 0 invention relates to a method of identifying an agent which 
is an inhibitor of UCP3 activity, wherein a nucleic acid 
construct comprising nucleic acid which encodes a mammalian 
UCP3 is introduced into a host cell(s). The host cells 
produced are maintained under conditions appropriate for 
25 expression of the encoded mammalian UCP3 , whereby the 
nucleic acid is expressed. The host cells are then 
contacted with a compound to be assessed (an agent) and the 
mitochondrial electrical potential of the cells is detected 
in the presence of the compound to be assessed. Detection 
30 of an increase of mitochondrial electrical potential in the 
presence of the agent indicates that the agent irjthibits 
UCP3 activity. 

Detection of a change in mitochondrial electrical 
potential can be performed using a variety of techniques. 
35 For example, a change in mitochondrial electrical potential 
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can be detected by measuring fluorescence of recombinant 
cells expressing UCP3 . Decrease of fluorescence in the 
presence of the test compound, indicates a decrease of 
mitochondrial membrane potential (mitochondrial A*) , and 
vice versa for cases where fluorescence is increased. That 
is, increase of fluorescence in the presence of the test 
compound indicates an increase of mitochondrial A*. If 
decrease in fluorescence is observed in UCP3 expressing 
cells, but not in control cells, then the test compound is 
an activator of UCP3 . If an increase in fluorescence is 
observed in UCP3 expressing cells, but not in control 
cells, then the test compound is an inhibitor of UCP3 . 

In a particular embodiment, as described in ^Example 3, 
a high throughput screen can be used to identify .agents 
15 that activate (enhance) or inhibit UCP3 activity. For 
example, the method of identifying an agent which alters 
UCP3 activity can be performed as follows. A nucleic acid 
construct comprising nucleic acid which encodes a mammalian 
UCP3 is introduced into a host cell(s) to produce 
20 recombinant host cells. The recombinant host cells 

produced are maintained under conditions appropriate for 
expression of the encoded mammalian UCP3 , whereby the 
nucleic acid is expressed. A fluorescent dye and the 
compound to be assessed are added to the recombinant host 
25 cells; the resulting combination is referred to as a test 
sample. Fluorescence is detected. A decrease of 
fluorescence in the presence of the test compound occurs 
with a decrease in the mitochondrial electrical potential 
of the cells, which indicates that the agent is an 
activator of UCP3 . Conversely, an increase of fluorescence 
in the presence of the test compound occurs with an 
increase in the mitochondrial electrical potential of the 
cells, which indicates that the agent is an inhibitor of 
UCP3 . -Suitable dyes for use in this embodiment of the 
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invention include, for example, JC-1, rhodamine 123, 
DiOCc[3], or tetramethylhydros amine . 

A control can be used in the methods of detecting 
agents which alter UCP3 activity. For example, the control 
5 sample includes the same reagents but lacks the compound or 
agent being assessed; it is treated in the same manner as 

the test sample. 

Also encompassed by the present invention is an agent 
whici; interacts with UCP3 directly or indirectly, and 

0 inhibits or enhances UCP3 expression and/or function. In 
one embodiment, the agent is an inhibitor which, interferes 
with UCP3 directly (e.g., by binding UCP3) or indirectly 
(e.g., by blocking the ability of UCP3 to function in 
thermogenesis) . In a particular embodiment, arv inhibitor 

5 of UCP3 protein is an antibody specific for UCP3 protein or 
a functional portion of UCP3 ; that is, the antibody binds 
the UCP3 protein. For example, the antibody can be 
specific for the protein encoded by the amino acid sequence 
of human UCP3 (SEQ ID NO: 3), human UCP3sh (SEQ ID NO: 4), 

0 mouse UCP3 (SEQ ID NO: 8) or portions thereof. 

Alternatively, the inhibitor can be an agent other than an 
antibody (e.g., small organic molecule, protein or peptide) 
which binds UCP3 and blocks its activity. For example, the 
inhibitor can be an agent which mimics UCP3 structurally, 

15 but lacks its function. Alternatively, it can be an agent 
which binds to or interacts with a molecule which UCP3 
normally binds with or interacts with, thus blocking UCP3 
from doing so and preventing it from exerting the effects 
it would normally exert. 

5 0 In another embodiment, the agent is an enhancer 

(activator) of UCP3 which increases the activity .of UCP3 
(increases the effect of a given amount or level of UCP3) , 
increases the length of time it is effective (by preventing 
its degradation or otherwise prolonging the time during 

3 5 which it is active) or both either directly or indirectly. 
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For example, UCP3 nucleic acids and proteins can be used to 
identify ant i -obesity drugs which enhance UCP3 to induce 
uncoupling in brown fat and/or skeletal muscle, with* the 
result that stored energy is released as heat. 
5 In another embodiment, the sequences described herein 

can be used to detect UCP3 or DNA encoding UCP3 in a 
sample. For example, a labeled. nucleic acid probe having 
all or a functional portion of the nucleotide sequence of 
UCP3 can be used in a method to detect UCP3 in a sample. 
10 In one embodiment, the sample is treated to render the 

nucleic acids in the sample available for hybridization to 
a nucleic acid probe, which can be DNA or RNA. The 
resulting treated sample is combined with a labeled nucleic 
acid probe having all or a portion of the nucleotide 
15 sequence of UCP3 , under conditions appropriate for 
hybridization of complementary sequences to occur. 
Detection of hybridization of nucleic acids from the sample 
with the labeled nucleic probe indicates the presence of 
UCP3 in a sample. The presence of UCP3 mRNA is indicative 
20 of UCP3 expression. Such a method can be used, for 

example, as a screen for normal or abnormal thermogenesis 
in skeletal muscle or brown adipose tissue. 

Alternatively, a method of detecting UCP3 in a sample 
can be accomplished using an antibody directed against UCP3 
25 or a portion of UCP3 . Detection of specific binding to the 
antibody indicates the presence of UCP3 in the sample 
(e.g., ELISA) . This could reflect a pathological state 
associated with UCP3 and, thus, can be used diagnostically . 
The sample for use in the methods of the present 
3 0 invention includes a suitable sample from, for example, a 
mammal, particularly a human. For example, the sample can 
be blood, skeletal muscle or brown adipose tissue. 

• The UCP3 sequences of the present invention can also 
be used to generate nonhuman gene knockout animals, such as 
35 mice, which lack UCP3 and transgenically overexpress UCP3 . 
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For example, such UCP3 gene knockout mice can, be generated 
and used to obtain further insight into the function of 
UCP3 as well as assess the specificity of UCP3 activators 
and inhibitors. Also, overexpression of UCP3 (e.g., human 
5 UCP3) in transgenic mice can be used as a means of creating 
a test system for UCP3 activators and inhibitors (e.g., 
against human UCP3). In addition, the UCP3 gene can be 
used to clone the UCP3 promoter/enhancer in order to 
identify regulators of UCP3 transcription. UCP3 gene 
10 knockout animals include animals which completely or 
partially lack the UCP3 gene and/or UCP3 activity or 
function. 

As described herein, it is likely that UCP3 splays a 
role in controlling protein wasting and production of 

15 gluconeogenic precursors by skeletal muscle via transport 

of one or more metabolites, which indicates that inhibitors 
of UCP3 can be used as a means of curtailing muscle wasting 
due to, for example, infection, (e.g., human 
immunodeficiency virus) cancer, tumor cachexia, muscle 

20 diseases (e.g., muscular dystrophy) or as a possible 
treatment for non- insulin dependent diabetes mellitus 
(NIDDM) . 

Thus the present invention relates to a method of 
inhibiting (partially, completely) protein catabolism in a 

25 mammal (e.g., human) comprising administering to the mammal 
an effective amount of an inhibitor of UCP3 . The invention 
also relates to a method of enhancing protein catabolism in 
a mammal comprising administering to the mammal an 
effective amount of an enhancer UCP3 . Also encompassed by 

3 0 the present invention is a method of inhibiting muscle 

wasting in a mammal comprising administering an effective 
amount of an enhancer of UCP3 to the mammal . 

A number of studies have demonstrated that brown 
adipose tissue plays an important role in regulating energy 

35 balance in rodents (Himms-Hagen, J., Progr. Lipid -Res., 
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in brown adipocytes, UCP2, which is expressed widely 
(Fleury, C, et al., Nature Genetics, 15:269-272 (1997); 
Gimeno, R.E., et al., Diabetes, in press (1997)) and, as 
10 demonstrated herein, UCP3 which is expressed selectively 
and abundantly in brown adipocytes and skeletal muscle . 
These features make brown fat ideally suited to regulated 

thermogenesis. 

In contrast to rodents, brown adipose tissue in large 
15 mammals is relatively limited and therefore brown fat may 
not be a significant regulator of human energy expenditure. 
A number of studies in humans have implicated skeletal 
muscle as an important mediator of adaptive thermogenesis 
in humans (Astrup, A. , et al., Am. J. Physiol., 248:E507- 
20 515 (1985); Astrup, A., et al., Am. J. Physiol., 257:E340- 
345 (1989); Zurlo, F., et al., J. Clin. Invest., 8S:1423- 
1427- (1990); Simonsen, L. , et al., Am. J. Physiol., 
263-.E850-855 (1992); Spraul, M. , etal., J. Clin. Invest., 
92:1730-1735 (1993)). Approximately 80% of the variance in 
25 resting energy expenditure between individuals can be 
accounted for by differences in fat-free mass (Ravussin, 
E., et al., Am. J. Clin. Nutr., 55: 242S-245S (1992)), much 
of which is skeletal muscle. Similarly, a perfused forearm 
study has demonstrated that differences in skeletal muscle 
30 energy expenditure account for much of the variation in 
metabolic rate observed between individuals (Zurlo, F., et 
al., J. Clin. Invest., 86:1423-1427 (1990)). Regulated 
energy expenditure in skeletal muscle is controlled, in 
large part, by sympathetic stimulation ((Astrup, A., et 
35 al., Am. J. Physiol., 248:E507-515 (1985); Astrup, A., et 
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al., Am. J". Physiol., 257-.E340-345 (1989); Simonsen, L., et 
al., Am. J. Physiol., 263:E850-855 (1992); Spraul , M. , et 
al., J". Clin. Invest., 52:1730-1735 (1993)). It is 
interesting to note that brown fat and skeletal muscle have 
5 many features in common: a rich blood supply, a dense 
sympathetic innervation, and abundant mitochondria. In 
addition, both tissues express high levels of UCP3 mRNA. 

The heart continuously expends large amounts of energy 
in order to maintain blood circulation. In view of thi3, 

10 it is probably significant that UCP3 is minimally expressed 
in cardiac tissue. This is especially true given the 
general tendency for non-contractile muscle-specific genes 
to be expressed in both striated muscle types (skeletal and 
cardiac) . Abundant expression of UCP3 in two thermogenic 

15 tissues, skeletal muscle and brown fat, and relative lack 
of expression in other sites such as the heart, 
demonstrates that UCP3 is an important molecular mediator 
of adaptive thermogenesis . 

Thus, the present invention provides for anti -obesity 

20 drug development wherein the UCP3 nucleic acids and protein 
can be used to identify, for example, enhancers 
(activators) of UCP3 which can be used to induce 
uncoupling. 0 3 -adrenergic receptor agonists, which 
increase UCP1 expression and activity in brown fat are 

25 presently under development, but may have limited effects 
given the paucity of brown fat in humans. UCP2 is another 
potential target. However, it is expressed in a number of 
critical organs and tissues and its activation could 
produce unwanted side effects. Specific activators of UCP3 

30 expression and/or function, on the other hand, selectively 
increase energy expenditure in skeletal muscle and brown 
fat, two tissues that have the capacity for adaptive energy 
expenditure . 
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The present invention is further illustrated by the 
following examples, which are not intended to be limiting 
in any way. 

EXAMPLE 1 CLONING AND CHARACTERIZATION OF THE UCP3 GENE 

Northern Blot Assays 

Human Multiple Tissue Northern Blots (#7760-1. #7759-1 
and #.7767-1) containing approximately 2 M 9 of polyA RNA per 
lane were purchased from Clontech Laboratories (Palo Alto, 
CA) All hybridizations, membranes washes and membrane 
strippings were performed according to manufacturer's 
specifications. The blots were first hybridized to a hUCP3 
probe, washed and exposed to film for 1-18 hours, then 
stripped, rehybridized to a hUCP2 probe and exposed to film 
for 18 hours. The hUCP3 probe was a 293 bp fragment 
corresponding to residues #211-308. The hUCP2 probe was a 
1125 bp fragment spanning the entire open reading frame. 
The specific activities of both hybridization probes were 
similar. Mouse Northern blots were generated using total 
RNA isolated from a number of tissues and equal loading of 
lanes was established using ethidium bromide florescence. 
The mouse Northern blots were hybridized using the hUCP3 
probe described above. 

RNase Protection 

Total RNA was extracted from adipose tissue the method 
; of Chomczynski and Sacchi (Chomczynski, P. and Sacchi, N. , 
Anal. Biochem., 162:156-159 (1987)). Skeletal muscle and 
heart RNA was obtained from Clontech. Aliquot s of 1, 3,5 
and 10 fig of adipose tissue and skeletal muscle RNA and 10 
/ig aliquot of heart RNA were used for determination of UCP3 
and mRNA levels. The Rnase protection assay was performed 
as previously described {Vidal-Puig^ A.,, et al.^ J. Clin. 
Invest., 97:2553-2561 (1997)). A UCP-3 cDNA fragment was 
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generated by reverse transcriptase- PCR using total RNA from 
human muscle as follows: two primers (5'GGA CTA CCA CCT GCT 
CAC TG 3' (SEQ ID NO : 23) and 5' CCC GTA ACA TAT GGA CTT 
T3 ' (SEQ ID NO: 24)) were designed to amplify 302 bp of the 
5 hUCP-3 sequence corresponding to residues #209-308. The 
PCR product was subcloned into PGMT easy TA cloning vector 
(Promega Corp., Madison, WI) and linearized for riboprobe 
synthesis using Spe I. Identity and orientation of the 
UCP3 probe was confirmed by sequencing. The antisense 
10 [ 32 P] -labeled UCP3 template was synthesized using T& RNA 
polymerase. A human cyclin riboprobe was used as an 
internal control (Ambion, Inc., Austin, TX) . 

Results 

As described herein, a third uncoupling homologue 

15 designated UCP3 has been cloned. It is distinguished from 
UCP1 and UCP2 by its selective expression in skeletal 
muscle and brown adipose tissue, two important sites for 
regulated energy expenditure in humans (Astrup, A. , et al., 
Am. J. Physiol., 248:E507-515 (1985); Astrup, A., et al . , 

20 Am. J. Physiol., 257:E340-345 (1989); Zurlo, F., et al . , J . 
Clin. Invest., 86:1423-1427 (1990); Simonsen, L . , et al . , 
Am. J. Physiol., 263:E850-855 (1992); Spraul , M - , et al . , 
J. Clin. Invest., 92:1730-1735 (1993)) and rodents (Himms- 
Hagen, J . , Progr. Lipid Res . , 28:67-115 (1989)). At the 

25 amino acid level, hUCP3 is 71% identical to hUCP2 and 57% 
identical to hUCPl . Because UCP3 is abundantly and 
selectively expressed in skeletal muscle and brown adipose 
tissue, UCP3 is likely to be an important mediator of 
regulated thermogenesis in humans . Since UCP3 is minimally 

3 0 expressed in heart and other critical organs, it .is a 

.promising target for anti-obesity drug development aimed at 
increasing thermogenesis. 
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The expressed sequence tag (EST) database 
(http://www.ncbi.nlm.gov) was screened for sequences 
homologous to UCP1. One human EST, deposited by the • 
Washington University, St. Louis - Merck & Co. EST project, 
was identified which was similar but not identical to hUCPl 
and hUCP2 (accession no. AA192136, IMAGE clone no. 628529). 
This clone originated from a human skeletal muscle cDNA 
library (#937209, Stratagene, La Jolla, CA)-. The bacterial 
stock for clone 628529 was obtained from Genome Systems 
(St. Louis, MI) and was found to contain an insert of 
approximately 1.3kb f which included the C- terminal third of 
the open reading frame . The coding region within clone 
628529 was fully resequenced. Full-length cDNA sequences 
were generated using the Marathon cDNA Amplification Kit, 
human skeletal muscle Marathon- Ready cDNA (both from 
Clontech Laboratories, Palo Alto, CA) and an antisense 
primer (5» -TTC ACC ACG TCC ACC CGG GGG GAT GCC ACC-3-' ) (SEQ 
ID NO: 25) corresponding to the coding sequence presumed to 

represent hUCP3 . 

UCP3 cDNA sequence contains a 5' untranslated region 

of at least 183 bases, an open reading from of 936 bases, a 
3' untranslated region of approximately 1.1 kb, a 
polyadenylation signal and a polyA tail (Figures 1A-1C) . 
The UCP3 mRNA transcript is predicted to be equal to or 
25 greater than 2.2 kb. UCP3 protein, as deduced from the 
open reading frame, is composed of 312 amino acids and is 
estimated to have a molecular weight of approximately 34 kD 
(Figure 3) . As shown in Figure 3, at the amino acid level, 
hUCP3 is 71% identical to hUCP2 and 57% identical to hUCPl; 
30 and hUCP2 is 59% identical to hUCPl . Many of the 
nonidentical residues in hUCP3 are conservative , 
substitutions which in most cases correspond to residues 
found in either mUCP2 (Fleury, C. , et al., Nature Genetics, 
15:269-212 (1997) j Gimenc R.E... et al., Diabetes, 46:900- 
35 906 (1997)) or in UCP1 from various species (Klaus, S., et 
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al., Jilt. J- Biochem., 23:791-801 (1991)). The data, based 
upon the high degree of homology between UCP1, UCP2 and 
UCP3 , demonstrates that UCP3 uncouples mitochondrial 
respiration. 

5 In order to establish the tissue distribution of UCP3 

in humans, Northern blot analyses were performed. UCP3 was 
abundantly expressed in skeletal muscle, generating a 
dominant mRNA transcript of approximately 2.4 kb. With 
longer exposure (18 hours), a much weaker UCP3 signal (2.4 
10 kb) was detected in a large number of other tissues and 

organs. The longer exposures (18 hours) of the human UCP3 
Northern blots also revealed the presence of a smaller mRNA 
transcript which had a similar size (approximately 1.6 kb) . 
Of note, the 294 bp hUCP3 probe employed was 7-5% identical 
15 to hUCP2. Rehybridization of the same blots with hUCP2 
confirmed that this smaller 1 . 6 kb signal was UCP2 . The 
UCP2 signal, as previously reported (Fleury, C. , et al . , 
Nature Genetics, 15:269-212 (1997); Gimeno, R.E. , et al . , 
Diabetes, 445:900-906 (1997)) was widely expressed. It was 
20 being most abundant in spleen, thymus, bone marrow, 

trachea, and lymph node, and somewhat less abundant in 
skeletal muscle as well as a number of other tissues. UCP2 
was also abundantly expressed in white adipose tissue as 
reported Gimeno, R.E., et al . , Diabetes, 445:900-906 
25 (1997)). A comparison of hybridization signals for UCP2 

and UCP3 suggests that UCP3 may be the dominant uncoupling 
protein transcript in human skeletal muscle. 

A sensitive RNase protection assay was used to assess 
UCP3 mRNA expression in heart, skeletal muscle and white 
30 adipose tissue. No UCP3 signal could be detected in white 
adipose tissue. In heart, a very weak UCP3 signal was 
. detected. The signal in heart was less than 1% of that 
detected in skeletal muscle. 

Tn mice, abundant TJCP3 expression was detected In 
35 skeletal muscle and brown fat. As with humans, little or 



no UCP3 expression was, 

as white adipose tissue, brain, kidney, liver and colon. 

As „as observed in the human mHNA studies , a smaller 
transcript was detected in mouse samples as well This 
smaller transcript most likely represents mUCP2 g.ven that 
it was most abundant in white adipose tissue, a site o£ 
high-level UCP2 expression (Fleury, C, et al., Mature 
Genetics, 15:269-272 (1997); Gimeno, R.E., et al., 
Diabetes, in press (1997), . 0£ note, the hUCP3 probe » 

73% identical to mUCP2. 

Figure 4 is a hydrophilicity plot o£ human UCP2 and 
human UCP3 showing the hydrophobicity o£ protein across 
linear sequence. 

EXAMPLE 2 Discovery - of an Alternative Form of UCP3, 

Designated UCP3 -short form (UCP3sh) 

As discussed above, the genomic organization of the 
human UCP3 gene has been defined. In addition, it has been 
determined that the UCP3 gene generates two mRNA 
transcripts, UCP3 and UCP3 -short form (UCP3sh) . The 
, nucleotide sequence of UCP3sh mRNA is shown in Figures 2A- 
2B The UCP3sh transcript encodes a shortened version of 
the UCP3 protein. As shown in Figure 8, the UCP3sh 
transcript results when a polyadenylation/transcription 
termination signal (AATAAA) (SEQ ID NO: 26) located within 
5 intron 6 terminates transcription (see Figure 3). However, 
this AATAAA (SEQ ID NO: 26) seems to be only partially 
effective in terminating transcription. When it does 
succeed in terminating transcription, the UCP3sh transcript 
is generated. When it fails to terminate transcription, 
,0 transcription continues on through exon 7 and terminates at 
the exon 7 AATAAA (SEQ ID NO: 26) signal. Splicing between 
exon £ ' and exon 7 then occurs to generate the UCP3 
transcript . 
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As shown is Figure 3, UCP3sh differs from UCP3 only by 
the absence of the last 37 amino acids. It is reasonable 
to expect that this is significant, since the region 
missing in UCP3sh is highly homologous to a region in UCP1 
5 which has been implicated in mediating inhibition of 

uncoupling activity by purine nucleotides (Murdza- Inglis , 
D.L., et al., J Biol Chem. 269: 7435-7438 (1994) ) , As a 
result, it is reasonable to expect that UCP3sh is more 
active as an uncoupler than UCP3 . 
10 Using a quantitative RNase protection assay similar to 

that described in Example 1, it was determined that UCP3sh 
mRNA, like UCP3 mRNA is extremely abundant in human 
skeletal muscle. In normal individuals, the lev^l of 
UCP3sh mRNA is skeletal muscle is equal to or' greater that 
15 the level of UCP3 mRNA. Preliminary studies have indicated 
that UCP3sh mRNA levels are reduced in obese individuals 
compared to lean individual. In contrast, UCP3 mRNA levels 
seem to be unchanged in obese individuals. These 
preliminary findings raise the possibility that UCP3sh is 
2 0 the more important UCP3 protein for body weight regulation. 



EXAMPLE 3 Cloning of mouse UCP3 gene 



Using the human UCP3 gene, the mouse UCP3 gene was 
isolated using methods similar to those described in 
Example 1. The mouse UCP3 nucleotide sequence (SEQ ID NO: 
25 7) is shown in Figures 5A-5C, and the mouse UPC3 amino acid 
sequence is shown in Figure 6. Comparisions of mUCP3 
versus mUCPl and mUP2 and human UCP3 are shown in Figure 7. 

EXAMPLE 4 Monitoring of JC-1 fluorescence in living cells 

An assay which utilizes f ibroblast-like cells lines 
30 expressing recombinant human UCP3 , and a fluorescent dye 
(e.g., JC-1) makes it possible to rapidly assess 
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mit ochondrial membrane potential (A*) xn ^^^^x- 
(Smiley, S.T.. et al., Proc. Natl. Acad. Sex aSA £3671 
3675 (1991); Reers, M., et al.. Methods an Enzymology, 
260-406-417 (1995)) - Any drug which increases UCP3 

5 activity is expected to reduce and therefore, reduce 
-red" -fluorescence of JC-1. By comparing effects of test 
compounds on fluorescence in a cell line expressing UCP3 
with a control (e.g., cells which do not express UCP3 ; 
cells which express UCP3 in the absence of the test 

L0 compound) , it is possible to identify specific activator 
and inhibitors of UCP3 . The cells can be grown m 96 well 
plates, and the plates can be read directly m a 
fluorometer designed to handle 96 well plates, it is 
possible to perform this assay in a high- throughput 

15 fashion. 

Recombinant cells expressing hUCP3 and cells not 
expressing UCP3 are grown in 96 well plates. On the day of 
analysis, the plates are rinsed and JC-1 dye is added to 
all wells plus or minus test compounds. Later, plates are 
20 washed and then, in the presence of the test compound, 

fluorescence is determined in a fluorometer. Decrease of 
fluorescence in the presence of the test compound, 
indicates a decrease of mitochondrial A* (and vice versa 
for cases where fluorescence is increased) . That is, 
25 increase of fluorescence in the presence of the test 

compound indicates an increase of mitochondrial A*. If 
decrease in fluorescence is observed in UCP3 expressing 
cells but not in control cells, then the test compound xs 
an activator of UCP3 . If an increase in fluorescence xs 
30 observed in UCP3 expressing cells, but not in control 
cells then the test compound is an inhibitor of .UCP3 . 

Any dye can be used in the high- throughout screen, 
such as JC-1, rhodamine 123, DiOCc[3], or 
tetramethylhydrosamine . in a particular embodiment, JC-1 
35 dye a delocalized lipophilic cation (DLC) , can be used. 
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The distinguishing feature of DLCs is that they are 
positively charged, yet lipophilic. The lipophilic feature 
allows then to traverse membranes and the positive charge 
causes then to accumulate within mitochondria {negatively 
5 charged on the inside) . This accumulation is proportional 
to A^, the membrane electrical potential across the inner 
mitochondrial membrane, and follows the Nernst Equation 
shown below. The mitochondrial A*' results from the protein 
electrochemical gradient across the inner mitochondrial 
10 membrane and represents the electrical portion of this 
gradient (ApH represents the chemical portion of the 
gradient) . 

A* = -60 log F in /F oul F = concentration of DLC 

Thus, a A*' of -60 mV corresponds to a DLC in/out ratio of 
15 10 to 1, and a AS' of -120 mV, corresponds to a DLC in/out 
ratio of 100 to 1. Thus, a change in AM' is amplified by a 
change in F in /F out . Of note, A*- for most mitochondrial range 
between -50 mV and -16 0 mV. 

Protonophore uncouplers such as DNP (dinitrophenol) , 
20 CCCP • (carbonyl cyanide m- chlorophenyl lhydrazone ) , decrease 
A* and, as a result, markedly decrease the accumulation of 
JC-1. Any drug which increases UCP activity is expected to 
have the same effect as DNP, CCCP or FCCP. 

JC-1 has fluorescent features which makes it extremely 
25 useful as a monitor of mitochondrial Mr. Many dyes 
aggregate at high concentrations and this reduces 
fluorescence greatly (for example, rhodamine 123) . 
Aggregates of JC-1 fluoresce intensely,- and at higher 
wavelength than JC-1 monomers. Specifically, monomers emit 
30 * at 527 nM (green) while J-aggregates emit at 590 nM (red) . 
Thus, high concentrations of JC-1 accumulate in 
mitochondria permitting the formation of aggregates. The 



without the uncoupler, tLu>, w ,» , 
times, trypsinized and then transferred as a cell 

eimtoal cm quartz cuvette, in which fluorescence 
10 suspension to a i cm yua 

was monitored using a Kontron SFH25 fluorescent 
spectrophotometer. 




Fluorescence 



(in arbitrary units) 

520 nM 
(green) 



590 nM 
(red) 



CX-1 cells 



CX-1 cells + FCCP 





10 



The data shows that JC-1 aggregate fluorescence can be 
monitored in living cells and that an uncoupler (FCCP) 
which is expected to have the same effect as a UCP 
activator markedly lowers "red" fluorescence. Fluorescence 
can also be monitored using a FACScan flow cytometer or in 
a single cell using fluorescence microscopy. 

EXAMPLE 5 UCP3 GENE EXPRESSION: Tissue Distribution and 

Physiologic Regulation 

Tissue Distribution - In humans, UCP3 is expressed 
abundantly and preferentially in skeletal muscle. In rats, 
UCP3 is expressed abundantly in skeletal muscle and brown 

fat. 

Starvation - UCP3 was dramatically increased "ty 
starvation in mice and rats (-5-10 fold). In humans, it 
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has been shown that 5 days of food restriction causes a 
2. 5- fold increase in UCP3 mRNA expression. Also, it was 
found that human UCP3 mRNA is significantly upregulated 
when transgenic mice bearing a human UCP3 PI clone are 
5 starved. Thus, it is likely that humans, like rodents, 
increase UCP3 gene expression with starvation. 

Role of FFAs - Recently, it was shown that treatment 
of fed rats with Intralipid plus heparin (which produced an 
increase in free fatty acids (FFAs) from 0.26 to 2.04 mM) 

10 caused a 3- fold increase in UCP3 (Weigle D.S., Diabetics, 
47:298-302 (1998)). Based upon this observation, it was 
suggested that the increase in FFAs with starvation was 
responsible for the effects of starvation on UCP3I mRNA 
levels. It was speculated that "this induction of UCP3 may 

15 be linked to the utilization of free fatty acids as a 

fuel". As discussed below however, it is unlikely that 
this hypothesis is true. 

Starvation plus Nicotinic Acid - 1 day fasted rats 
were treated with saline or nicotinic acid for 6 hours and 

2 0 the effects on UCP3 gene expression were assessed. 

Starvation increases lipolysis in adipose tissue, causing a 
marked increase in blood levels of FFAs. The increase in 
FFAs is thought to promote conservation of protein in 
skeletal muscle (when lipid fuels are abundant, the 
25 requirement for gluconeogenesis from muscle protein is 

reduced) . Nicotinic acid inhibits lipolysis, restores FFA 
levels to fed values, and stimulates protein catabolism in 
skeletal muscle (Lowell and Goodman, Diabetics, 36:14-19 
(1987) ) . The experiment described herein shows that 

3 0 nicotinic acid treatment of fasted animals returned FFA 

levels to fed values, but increased UCP3 mRNA to .levels 2- 
» fold higher than those observed in saline treated fasted 
controls. This observation shows that the starvation- 
induced rise in FFAs is not responsible fox the effects "of 
35 starvation on UCP3 mRNA levels. Also, it shows that UCP3 
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is not linked to the utilization of FFAs as fuel. Instead, 
based upon this finding it is reasonable to expect that 
UCP3 is linked to protein catabolism in skeletal muscle. 

Streptozotocin Diabetes - Fourteen days of 
streptozotocin diabetes in rats produced a very large 
increase in UCP3 mRNA levels. This rise in UCP3 was 
reversed with one day of insulin treatment. Streptozotocin 
diabetes is associated with significant protein catabolism 

in skeletal muscle. 

Endotoxin - Endotoxin treatment of rats and mice 
resulted in a very large increase in UCP3 mRNA levels in 
skeletal muscle, but not in other tissues. Endotoxin is a 
well known stimulator of protein catabolism in skeletal 
muscle. 

Dexamethasone - High dose dexamethasone treatment 
markedly stimulated UCP3 mRNA levels in skeletal muscle, 
but not in other tissues. Dexamethasone is also a well 
known stimulator of protein catabolism in skeletal muscle. 
Thyroid Hormone - High dose thyroid treatment in rats 
0 stimulated UCP3 mRNA levels. Thyroid hormones seemed to 
have little or no effect in mice. Thyroid hormone is also 
a well known stimulator of protein catabolism in skeletal 
muscle . 

ob/ob and db/db mice: fa/ fa rats - These genetically 
25 obese rodents were generated and shown to have markedly 
increased UCP3 mRNA levels in skeletal muscle. It is 
likely that increased UCP3 mRNA levels in ob/ob mice 
contributed to elevated production of gluconeogenic 
precursors by muscle, thereby promoting non-insulin 
30 dependent diabetes mellitus (NIDDM) in these animals. 

It is interesting to note that nearly all pqsitive 
regulators of UCP3 gene expression (starvation, nicotinic 
acid treatment during starvation, streptozotocin diabetes, 
endotoxin, dexamethasone and thyroid hormone) are 
35 associated with catabolism of skeletal muscle protein Isee 
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Mitch and Goldberg, NEJM, 335:1897-1905 (1996)). The only 
exceptions to this are genetically obese rodents (however, 
these animals do have decreased muscle mass) . From another 
perspective, it is also true that all catabolic states 
5 tested to date are associated with increased UCP3 
expression. 

Given that increased UCP3 gene expressions is linked 
to states of augmented skeletal muscle protein catabolism, 
it is likely that UCP3 plays an important role in 
10 regulating skeletal muscle protein catabolism (conversion 
of muscle protein to gluconeogenic precursors) . Possible 
mechanisms by which UCP3 plays a role are the following: 

a) UCP3 is a mitochondrial carrier which transports 
biosynthetic metabolites in and out of mitochondria 

15 during skeletal muscle protein catabolism (i.e., 

conversion of aspartate, glutamate, valine, isoleucine 
and leucine to gluconeogenic precursors alanine and 
glut amine) . 

b) UCP3 is the aspartate/glutamate carrier and is rate 

20 limiting for operation of the aspartate/malate shuttle 

, (transfers cytosolic NADH into the mitochondria) . 
Increased operation of this shuttle would reduce the 
cytosolic NADH /NAD ratio. It has been suggested that 
the cytosolic NADH/NAD ratio regulates muscle protein 

25 catabolism. 

c) UCP3 is indeed a genuine uncoupling protein and 
increased UCP3 activity in catabolic states oxidizes 
the whole cell redox state (NADH/NAD ratio) , thereby 
stimulating protein catabolism and amino acid 

30 metabolism. 

Skeletal Muscle Metabolism During Starvation- (and other 
cataboT'ic states") . 



significantly metabolized inside the myocytes prior to 
their release into the bloodstream. Alanine and glutamme 
XO represent approximately 12% amino acids in muscle protein 
but together represent > 50% of amino acids released by 
muS cle during starvation. Thus, much of the alanine and 
glutamine released must be synthesized. In contrast, 
aspartate, asparginine, glutamate, leucine, isoleucine and 
15 valine represent > 30% of amino acids in muscle protein but 
are released in only small amounts during starvation. These 
amino acids are interconverted to alanine and glutamine by 
muscle. Other amino acids such as glycine, cysteine, 
serine, threonine, methionine, proline, lysine, argimne, 
20 histidine, phenylalanine, tyrosine and tryptophan 

represents approximately 50% of muscle protein and are 
released either unchanged or as deaminated a -ketoacids. 

Alanine is generated by the transamination of 
pyruvate. The pyruvate (i.e., carbon) for alanine 
25 synthesis come from glycolysis while the nitrogen 

originates from aspartate, asparginine, glutamate, leucine, 
. isoleucine and valine. The released alanine is taken up by 
the liver and used to synthesize glucose. The glucose is 
then returned to the muscle and is metabolized into 
30 pyruvate, thus completing the glucose-alanine cycle. It is 
important to note that no new glucose is synthesized by 
this process, the carbons are simply recycled. Thus, the 
glucose-alanine cycle functions to conserve carbohydrate, 
but doe's not generate new carbohydrate. The cycle also 
35 functions to transfer NH2 from amino acids with are 
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metabolized (aspartate, asparginine, glutamate, leucine, 
isoleucine and valine) to the liver where it can be 
detoxified via the urea cycle. 

Because certain tissues are oxidizing glucose to C0 2 
5 (i.e., the brain), new glucose must be synthesized during 
starvation. This new glucose is synthesized from 
glutamine, which is released by muscle. The carbon 
backbone for glutamine comes from aspartate, asparginine, 
glutamate, isoleucine and valine, while the nitrogen comes 
10 from these same amino acids plus leucine. The leucine 
carbon backbone is completely oxidized to C0 2 by muscle. 
The glutamine released by muscle is taken up by the kidney 
and intestines, where a complex pathway is initiated 
culminating in the synthesis of glucose. Glutamine 
15 synthetase is the enzyme which converts glutamate to 

glutamine, the final step in glutamine synthesis. It is 
interesting to note that glutamine synthetase gene 
expression in muscle, like UCP3 gene expression, is 
induced by starvation, streptozotocin diabetes, endotoxin 
20 treatment and dexamethasone . It is also interesting to 
note, as was seen with UCP3 , that these effects on 
glutamine synthetase gene expression are observed in 
skeletal muscle, but not in other tissues. 

Significant mitochondrial metabolism must occur in 
25 order for aspartate, asparginine, glutamate, isoleucine, 
valine and leucine to be interconverted to alanine and 
glutamate. This is because important enzymes involved in 
the interconversion are located within the mitochondrial 
matrix. One example is branched chain a-ketoacid 
3 0 dehydrogenase (BCKADH) , an enzyme which initiates the 

oxidation of leucine, isoleucine and valine. Of .interest, 
* BCKADH activity in muscle increases significantly during 
starvation and streptozotocin diabetes. Thus, metabolites 
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mus t flux in and out of mitochondria for muscle to release 
alanine and glutamine during catabolic states . 



EQUIVALENTS ^ coa nize, or be able to 

Those skilled in the art will recognize, 

routine experimentation, many 
„„ ort ..i n using no more than routine <=^f 

"nts to"*, specific °\2£T™ 
described herein. Such equivalents are mtended to 
encompassed by the following claims. 
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SEQUENCE LISTING 

(1) GENERAL. INFORMATION: 
(i) APPLICANT: 



(A) 


NAME : 


Beth Israel Deaconess Medical Center 


(B) 


STREET : 


330 Brookline Avenue 


(C) 


CITY: 


Boston 


(D) 


STATE/ PROVINCE : 
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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1220 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : other nucleic acid 
(A) DESCRIPTION : /desc = "DNA 

■ 

(xi ) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
.CGAGGGGCC ATCCAATCCC TGCTGCCACC TCCTGGGATG GAGCCCTAGG GAGCCCCTGT 
CCTGCCCCTG CCGTGGCAGG ACTCACAGCC CCACCGCTGC ACTGAAGCCC AGGGCTGTGG 
AGCAGCCTCT CTCCTTGGAC CTCCTCTGGG CCCTAAAGGG ACTGGGCAGA GCCTTCCAGG 
ACTATGGTTG GACTGAAGCC TTCAGACGTC CCTCCCACCA TGGCTGTGAA GTTCCTGGGG 
GCAGGCACAG CAGCCTGTTT TGCTGAACTC GTTACCTTTC CACTGGACAC AGCCAAGGTC 
CGCCTGCAGA TCCAGGGGGA GAACCAGGCG GTCCAGACGG CCCGGCTCGT GCAGTACCGT 
GGCGTGCTGG GCACCATCCT GACCATGGTG CGGACTGAGG GTCCCTGCAG CCCCTACAAT 
GGGCTGGTGG CCGGCCTGCA GCGCCAGATG AGCTTCGCCT CCATCCGCAT CGGCCTCTAT 
CACTCCGTCA AGCAGGTGTA CACCCCCAAA GGCGCGGACA ACTCCAGCCT CACTACCCGG 
ATTTTGGCCG GCTGCACCAC AGGAGCCATG GCGGTGACCT GXGCCCAGCC CACAGATGTG 
GTGAAGGTCC GATTTCAGGC CAGCATACAC CTCGGGCCAT CCAGGAGCGA CAGAAAATAC 
AGCGGGACTA TGGACGCCTA CAGAACCATC GCCAGGGAGG AAGGAGTCAG GGGCCTGTGG 
AAAGGAACTT TGCCCAACAT CATGAGGAAT GCTATCGTCA ACTGTGCTGA GGTGGTGACC 
TACGACATCC TCAAGGAGAA GCTGCTGGAC TACCACCTGC TCACTGACAA CTTCCCCTGC 
CACTTTGTCT CTGCCTTTGG AGCCGGCTTC TGTGCCACAG TGGTGGCCTC CCCGGTGGAC 
GTGGTGAAGA CCCGGTATAT GAACTCACCT CCAGGCCAGT ACTTCAGCCC CCTCGACTGT 
ATGATAAAGA TGGTGGCCCA GGAGGGCCCC ACAGCCTTCT ACAAGGGATT TACACCCTCC 
TXTTTGCGTT TGGGATCCTG GAACGTGGTG ATGTTCGTAA CCTATGAGCA GCTGAAACGG 
GCCCTGATGA AAGTCCAGAT GTTACGGGAA TCACCGTTTT GAACAAGACA AGAAGGCCAC 
TGGTAGCTAA CGTGTCCGAA ACCAGTTAAG AATGGAAGAA AACGGTGCAT CCACGCACAC 
ATGGACACAG ACCCACACAT 
J2) INFORMATION FOR SEQ XD NO:-2-: 

(i) SEQUENCE CHARACTERISTICS; 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1220 
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(A) LENGTH: 1034 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNES S : u i ng 1 e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = " DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 



GAGGGGCCAT 


CCAATCCCTG 


CTGCCACCTC 


CTGGGATGGA 


GCCCTAGGGA 


GCCCCTGTGC 


60 


TGCCCCTGCC 


GTGGCAGGAC 


TCACAGCCCC 


ACCGCTGCCT 


GAAGCCCAGG 


GCTGTGGAGC 


120 


AGCCTCTCTC 


CTTGGACCTC 


CTCTCGGCCC 


TAAAGGGACT 


GGGCAGAGCC 


TTCCAGGACT 


180 


ATGGTTGGAC 


TGAAG C CTT L. 






CTGTGAAGTT 


C CTGGGGG C A 

^^^^ JL v w WW W W*l 


240 


GGCACAGCAG 


CCTGTTTTGC 


TGAACTCGTT 


ACCTTTCCAC 


TGGACACAGC 


CAAGGTCCGC 
• 


300 


CTGCAGATCC 


AGGGGGAGAA 


CCAGGCGGTC 


CAGACGGCCC 


GGCTCGTGCA 


GTACCGTGGC 


360 


GTGCTGGGCA 


CCATCCTGAC 


CATGGTGCGG 


ACTGAGGGTC 


CCTGCAGCCC 


CTACAATGGG 


420 


CTGGTGGCCG 


GCCTGCAGCG 


CCAGATGAGC 


TTCGCCTCCA 


TCCGCATCGG 


CCTCTATGAC 


480 


TCCGTCAAGC 


AGGTGTACAC 


CCCCAAAGGC 


GCGGACAACT 


TCCAGCCTCA 


CTACCCGGAT 


540 


TTTGGCCGGC 


TGCACCACAG 


GAGCCATGGC 


GGTGACCTGT 


GCCCAGCCCA 


CAGATGTGGT 


600 


GAAGGTC CGA 


TTTCAGGCCA 


GCATACACCT 


CGGGCCATCC 


AGGACCGACA 


GAAAATACAG 


660 


CGGGACTATG 


GACGCCTACA 


GAACCATCGC 


CAGGGAGGAA 


GGAGTCAGGG 


GCCTGTGGAA 


720 


AGGAACTTTG 


CCCAACATCA 


TGAGGAATGC 


TATCGTCAAC 


TGTGCTGAGG 


TGGTGACCTA 


780 


CGACATCCTC 


AAGGAGAAGC 


TGCTGGACTA 


CCACCTGCTC 


ACTGACAACT 


TCCCCTGCCA 


840 


CTTTGTCTCT 


GCCTTTGGAG 


CCGGCTTCTG 


TGCCACAGTG 


GTGGCCTCCC 


CGGTGGACGT 


900 


GGTGAAGACC 


CGGTATATGA 


ACTCACCTCC 


AGGCCAGTAC 


TTCAGCCCCC 


TCGACTGTAT 


960 


GATAAAGATG 


GTGGCCCAGG 


AGGGCCCCAC 


AGCCTTCTAC 


AAGGGGTGAG 


CCTCCTCCTG 


1020 


CCTCCAGCAC 


TCCC 










1034 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 312 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNES S : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
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Met va! 01* — W. "» «« Vsl 5" *" Iht "* S 1 

5 

Phe Leu Gly Ala Oly Thr Ala Ala Cys Phe Ala Glu Leu Val Thr Phe 



Pro Leu Asp Thr Ala Lys Val Arg Leu Gin He Gin Gly Glu Asn Gin 
Ala Val Gin Thr Ala Arg Leu Val Gin Tyr Arg Gly Val Leu Gly Thr 



55 

50 M 



He Leu Thr Met Val Arg Thr Glu Gly Pro Cys Ser Pro Tyr Asn Gly 

l Val Ala Gly Leu Gin Arg Gin Met Ser Phe Ala Ser lie Arg He 

85 

Gly Leu Tyr Asp Ser Val Lys Gin Val Tyr Thr Pro Lys Gly Ala Asp 

100 10:3 
Asn Ser Ser Leu Thr Thr Arg lie Leu Ala Gly Cys Thr Thr Gly Ala 

115 120 
Met Ala Val Thr Cys Ala Gin Pro Thr Asp Val Val Lys Val' Arg Phe 

130 135 
Gin Ala Ser lie His Leu Gly Pro Ser Arg Ser Asp Arg Lys Tyr Ser 



145 



150 



m aw. Thr- Tie Ala Arq Glu Glu Gly Val Arg 
Gly Thr Met Asp Ala Tyr Arg Thr lie Ala Arg ^ 

165 



Gly Leu Trp Lys Gly Thr Leu Pro Asn lie Met Arg Asn Ala lie Val 

180 185 



^ Cys Ala Glu Val Val Thr Tyr Asp He Leu Lys Glu Lys Leu Leu 



195 200 



ASP Tyr His Leu Leu Thr Asp Asn Phe Pro Cys His Phe Val Ser Ala 



210 215 



Phe Gly Ala Gly Phe Cys Ala Thr Val Val Ala Ser Pro Val Asp Val 



225 



230 



Val Lys Thr Arg Tyr Met Asn Ser Pro Pro Gly Gin Tyr Phe Ser Pro 



245 250 



Leu Asp Cys Met lie Lys Met Val Ala Gin Glu Gly Pro Thr Ala Phe 



260 265 



Tyr Lys Gly Phe Thr Pro Ser Phe Leu Arg Leu Gly Ser Trp Asn Val 



275 280 
Val Met Phe Val Thr Tyr Glu Gin Leu Lys Arg Ala Leu Met Lys Val 
290 295 



Gin Met Leu Arg Glu Ser Pro Phe 
305 310 

(2) INFORMATION FOR SEQ ID NO: 4: 



* 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 275 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 
{ D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Val Gly Leu Lys Pro Ser Asp Val Pro Pro Thr Met Ala Val Lys 
r 5 10 15 

Phe Leu Gly Ala Gly Thr Ala Ala Cys Phe Ala Glu Leu Val Thr Phe 

20 25 30 

Pro Leu Asp Thr Ala Lys Val Arg Leu Gin lie Gin Gly Glu Asn Gin 
35 40 45 

Ala Val Gin Thr Ala Arg Leu Val Gin Tyr Arg Gly Val LeU Gly Thr 
50 55 60 

He Leu Thr Met Val Arg Thr Glu Gly Pro Cys Ser Pro Tyr Asn Gly 
65 70 75 80 

Leu Val Ala Gly Leu Gin Arg Gin Met Ser Phe Ala Ser He Arg He 

05 90 95 

Gly Leu Tyr Asp Ser Val Lys Gin Val Tyr Thr Pro Lys Gly Ala Asp 

100 105 HO 

Asn Ser Ser Leu Thr Thr Arg He Leu Ala Gly Cys Thr Thr Gly Ala 
115 120 125 

Met Ala Val Thr Cys Ala Gin Pro Thr Asp Val Val Lys Val Arg Phe 
130 135 140 

Gin Ala Ser He His Leu Gly Pro Ser Arg Ser Asp Arg Lys Tyr Ser 
145 150 155 160 

Gly Thr Met Asp Ala Tyr Arg Thr He Ala Arg Glu Glu Gly Val Arg 

165 170 175 

Gly Leu Trp Lys Gly Thr Leu Pro Asn He Met Arg Asn Ala He, Val 

180 185 190 

Asn Cys Ala Glu Val Val Thr Tyr Asp He Leu Lys Glu Lys Leu Leu 
195 200 205 

Asp Tyr His Leu Leu Thr Asp Asn Phe Pro Cys His Phe Val Ser Ala 
210 215 220 

Phe Gly Ala Gly Phe Cys Ala Thr Val Val Ala Ser Pro Val Asp Val 
225 230 235 240 

Val Lys Thr Arg Tyr Met Asn Ser Pro Pro Gly Gin Tyr Phe Ser Pro 

245 250 255 
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Leu Asp Cy S Met He Lys Met Val Ala Gin Glu Gly Pro Thr Ala Phe 



260 



Tyr Lys Gly 
275 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 307 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
i 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Gly Gly Leu Thr Ala Ser Asp Val His Pro Thr Leu Gly Val Gin 

C 10 -k? 

Phe Ser Ala Gly He Ala Ala Cys Leu Ala Asp Val lie Thr Phe 



1 

Leu 
Pro 



Leu Asp Thr Ala Lys Val Arg Leu Gin Val Gin Gly Glu Cys Pro 
35 40 45 

Thr Ser Ser Val lie Arg Tyr Lys Gly Val Leu Gly Thr lie Thr Ala 

50 55 
Val Val Lys Thr Glu Gly Arg Met Lys Leu Tyr Ser Gly Leu Pro Ala 
65 70 75 

Gly Leu Gin Arg Gin lie Ser Ser Ala Ser Leu Arg He Gly Leu Tyr 

85 ,90 

Asp Thr Val Gin Glu Phe Leu Thr Ala Gly Lys Glu Thr Ala Pro Ser 

Leu Gly Ser Lys lie Leu Ala Gly Leu Thr Thr Gly Gly Val Ala Val 

115 120 

Phe He Gly Gin Pro Thr Glu Val Val Lys Val Arg Leu Gin Ala Gin 
130 135 140 

Ser His Leu His Gly lie Lys Pro Arg Tyr Thr Gly Thr Tyr Asn Ala 
145 150 155 

Tyr Arg He He Ala Thr Thr Glu Gly Leu Thr Gly Leu Trp Lys Gly 

Thr Thr Pro Asn Leu Met Arg Ser Val He He Asn Cys Thr Glu "Leu 

180 185 iyu • 

Val Thr Tyr Asp Leu Met Lys Glu Ala Phe Val Lys Asn Asn He Leu 
195 200 205 

Ala Acp'Asp Val Pro Cys His Leu Val Ser Ala Leu lie Ala Gly Phe 
210 215 220 
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Cys Ala Thr Ala Met Ser Ser Pro Val Asp Val Val Lys T.hr Arg Phe 
225 230 235 240 

lie Asn Ser Pro Pro Gly Gin Tyr Lys Ser Val Pro Asn Cys Ala Met 

245 250 255 

Lys Val Phe Thr Asn Glu Gly Pro Thr Ala Phe Phe Lys Gly Leu Val 

260 265 270 

Pro Ser Phe Leu Arg Leu Gly Ser Trp Asn Val lie Met Phe Val Cys 
275 280 285 

Phe Glu Gin Leu Lys Arg Glu Leu Ser Lys Ser Arg Gin Thr Met Asp 
290 295 300 

Cys Ala Thr 
305 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 308 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Met Val Gly Phe Lys Ala Thr Asp Val Pro Pro Thr Ala Thr Val Lys 
1 5 10 15 

Leu Phe Gly Ala Gly Thr Ala Ala Cys lie Ala Asp Leu lie Thr Phe 

20 25 30 

Pro Leu Asp Thr Ala Lys Val Arg Leu Gin lie Gin Gly Glu Ser Gin 
35 40 45 

Gly Pro Val Arg Ala Thr Val Ser Ala Gin Tyr Arg Gly Val Met Gly 
50 55 60 

Thr lie Leu Thr Met Val Arg Thr Glu Gly Pro Arg Ser Leu Tyr Asn 
65 70 75 80 

Cys Leu Val Ala Gly Leu Gin Arg Gin Met Ser Phe Ala Ser Val Arg 

85 90 95 

lie Gly Leu Tyr Asp Ser Val Lys Gin Phe Tyr Thr Lys Gly Ser Glu 

100 105 110 

His Ala Ser lie Gly Ser Arg Leu Leu Ala Gly Ser Thr Thr Gly Ala 
115 120 125 

* Leu Ala Val Ala Val Ala Gin Pro Thr Asp Val Val Lys Val Arg Phe 
130 135 140 

Gin Ala. Gin Arg Ala Gly Gly Gly Arg Arg Tyr Gin Ser Thr Val Asn 
145 150 155 160 
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Ala Tyr Lys Thx lie Ala Arg Glu Glu Gly Phe Arg Gly Leu Trp Lys 

165 

Gly Thr Ser Pro Asn Val Ala Arg Asa Ala lie Val Asn Cys Ala Glu 

180 

Leu Val Thr Tyr Asp Leu He Lys Asp Ala Leu Leu Lys Ala Asn Leu 

195 200 
Met Thr Asp Asp Leu Pro Cys His Phe Thr Ser Ala Phe Gly Ala Gly 



215 



Phe Cys Thr Thr Val He Ala Ser Pro Val Asp Val Val Lys Thr Arg 

225 230 

Tyr Met Asn Ser Ala Leu Gly Gin Tyr Ser Ser Ala Gly His Cys Ala 

245 

Leu ~* " 



Thr Met Leu Gin Lys Glu Gly Pro Arg Ala Phe Tyr Lys Gly Phe 
260 265 

Met Pro Ser Phe Leu Arg Leu Gly Ser Trp Asn Val Val Met Phe Val 

275 280 
Thr Tyr Glu Gin Leu Lys Arg Ala Leu Met Ala Ala Cys Thr Ser Arg 

290 2 ^5 

Glu Ala Pro Phe 
305 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1204 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = M DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 
GAGACAACAG TGAATGGTGA GGCCCGGCCG TCAGATCCTG CTGCTACCTA ATGGAGTGGA 
GCCTTAGGGT GGCCCTGCAC TACCCAACCT TGGCTAGACG CACAGCTTCC TCCCTGAACT 
GAAGCAAAAG ATTGCCAGGC AAGCTCTCTC CTCGGACCTC CATAGGCAGC AAAGGAACCA 
GGCCCATTCC CCGGGACCAT GGTTGGACTT CAGCCCTCCG AAGTGCCTCC CACAACGGTT 
GTGAAGTTCC TGGGGGCCGG CACTGCGGCC TGTTTTGCGG ACCTCCTCAC TTTTCCCCTG 
GACACCGCCA AGGTCCGTCT GCAGATCCAA GGGGAGAACC CAGGGGCTCA GAGCGTGCAG 
TACCGCGGTG TGCTGGGTAC CATCCTGACT ATGGTGCGCA CAGAGGGTCC CCGCAGCCCC 
TACAGCGGAC TGGTCGCTGG CCTGCACCGC CAGATGAGTT TTGCCTCCAT TCGAATTGGC 
CTCTACGACT CTGTCAAGCA GTTCTACACC CCCAAGGGAG CGGACCACTC CAGCGTCGCC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
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ATCAGGATTC TGGCAGGCTG CACGACAGGA GCCATGGCAG TGACCTGCGC CCAGCCCACG 600 

GATGTGGTCA AGGTCCGATT TCAAGCCATG ATACGCCTGG GAACTGGAGG AGAGAGGAAA 660 

TACAGAGGGA CTATGGATGC CTACAGAACC ATCGCCAGGG AGGAAGGAGT CAGGGGCCTG 72 0 

TGGAAAGGGA CTTGGCCCAA CATCACAAGA AATGCCATTG TCAACTGTGC TGAGATGGTG 780 

ACCTACGACA TCATCAAGGA GAAGTTGCTG GAGTCTCACC TGTTTACTGA CAACTTCCCC 840 

TGTCACTTTG TCTCTGCCTT TGGAGCTGGC TTCTGTGCCA CAGTGGTGGC CTCCCCGGTC 90 0 

GATGTGGTAA AGACCCGATA CATGAACGCT CCCCTAGGCA GGTAC CGC AG CCCTCTGCAC 960 

TGTATGCTGA AGATGGTGGC TCACGAGGGA CCCACGGCCT TCTACAAAGG ATTTGTGCCC 1020 

TCCTTTCTGC GTCTGGGAGC TTGGAACGTG ATGATGTTTG TAACATATCA GCAACTGAAG 1080 

AGGGCCTTAA TGAAAGTCCA GGTACTGCGG GAATCTCCGT TTTGAACAAG GCAAGCAGGC 1140 

TGCCTGGAAC AGAACAAAGC GTCTCTGCCT GGGACACAGG CCCACACGTC AGAACCGTGC 12 00 

ACGC 



1204 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 8 amino acids 

(B) TYPE: amino acid- 
ic) STRANDEDNESS : unknown 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Val Gly Leu Gin Pro Ser Glu Val Pro Pro Thr Thr Val Val Lys 
! 5 10 15 

Phe Leu Gly Ala Gly Thr Ala Ala Cys Phe Ala Asp Leu Leu Thr Phe 

20 25 30 

Pro Leu Asp Thr Ala Lys Val Arg Leu Gin He Gin Gly Glu Asn Pro 
35 40 45 

Gly Ala Gin Ser Val Gin Tyr Arg Gly Val Leu Gly Thr He Leu Thr 
50 55 60 

Met Val Arg Thr Glu Gly Pro Arg Ser Pro Tyr Ser Gly Leu Val Ala 
65 70 75 80 

" " Gly Leu His Arg Gin Met Ser Phe Ala Ser He Arg He Gly Leu Tyr 

85 90. 95 

Asp Ser Val Lys Gin Phe Tyr Thr Pro Lys Gly Ala Asp His Ser Ser 

100 105 HO 

Val Ala He Arg He Leu Ala Gly Cys Thr Thr Gly Ala Met Ala Val 
115 120 125 



195 



200 



Ph e Thr MP Asn Phe Pro Cys His Phe Val Ser Ala Phe Gly Ala Gly 

210 " s 

Val Val Ala Ser Pro Val Asp Val Val Lys Thr Arg 



Phe Cys Ala Thr 



225 



230 



„r Met Asn Ala Pro Leu Gly Arg Tyr Arg Ser Pro Leu His Cys Met 

Val Ala Gin Glu Gly Pro Thr Ala Phe Tyr Lys Gly Phe 

265 " w 



Leu Lys Met 



Thr Tyr 

290 



Arg Leu Gly Ala Trp Asn Val Met Met Phe Val 
280 285 

Glu Gin Leu Lys Arg Ala Leu Met Lys Val Gin Val Leu Arg 

295 JUU 



Val Pro Ser Phe Leu 
275 



Glu Ser Pro Phe 
305 



INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS : _ 

(A) LENGTH: 307 amino aci< 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Me t Val Asn Pro Thr Thr Ser Glu Val Gin Pro Thr Met Gly Val Lys 

1 5 1 

lie Phe ser Ala Gly Val Ser Ala Cys Leu Ala Asp lie lie Thr Phe 

20 25 

Pro Leu Asp Thr Ala Lys Val Arg Leu Gin 0* <ta Gly Glu Gly O. 
35 40 
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Ala Ser Ser Thr lie Arg Tyr Lys Gly Val Leu Gly Thr lie Thr Thr 
50 55 60 

Leu Ala Lys Thr Glu Gly Leu Pro Lys Leu Tyr Ser Gly Leu Pro Ala 
65 70 75 80 

Glv He Gin Arg Gin He Ser Phe Ala Ser Leu Arg He Gly Leu Tyr 
2 85 90 95 

Asp Ser Val Gin Glu Tyr Phe Ser Ser Gly Arg Glu Thr Pro Ala Ser 

100 105 HO 

Leu Gly Asn Lys He Ser Ala Gly Leu Met Thr Gly Gly Val Ala Val 
115 120 125 

Phe He Gly Gin Pro Thr Glu Val Val Lys Val Arg Met Gin Ala Gin 
130 135 140 

Ser His Leu His Gly He Lys Pro Arg Tyr Thr Gly Thr Tyr Asn Ala 
145 150 155 160 

Tvr Arq Val He Ala Thr Thr Glu Ser Leu Ser Thr Leu Trp Lys Gly 
y 165 170 175 

Thr Thr Pro Asn Leu Met Arg Asn Val He He Asn Cys Thr Glu Leu 

180 185 190 

Val Thr Tyr Asp Leu Met Lys Gly Ala Leu Val Asn Asn Lys He Leu 
195 200 205 

Ala Asp Asp Val Pro Cys His Leu Leu Ser Ala Leu Val Ala Gly Phe 
210 215 220 

Cvs Thr Thr Leu Leu Ala Ser Pro Val Asp Val Val Lys Thr Arg Phe 
225 230 235 240 

He Asn Ser Leu Pro Gly Gin Tyr Pro Ser Val Pro Ser Cys Ala Met 

245 250 255 

Ser Met Tyr Thr Lys Glu Gly Pro Thr Ala Phe Phe Lys Gly Phe Val 

260 265 270 

Ala Ser Phe Leu Arg Leu Gly Ser Trp Asn Val He Met Phe Val Cys 
275 200 285 

Phe Glu Gin Leu Lys Lys Glu Leu Met Lys Ser Arg Gin Thr Val Asp 
290 295 300 

Cys Thr Thr 
305 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 309 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : unknown 

(D) TOPOLOGY: linear 

i±±) MOLECULE TYPE: protein 



WO 98/45438 



-56- 



(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Val Gly Phe Lys Ala Thr Asp Val Pro Pro Thr Ala Thr Val Lys 

Phe Leu Gly Ala Gly Thr Ala Ala Cys lie Ala Asp Leu lie Thr Phe 

20 

o t , A«,n Thr Ala Lys Val Arg Leu Gin lie Gin Gly Glu Ser Gin 

Pro Leu Asp Tnr mci ^ 45 



Gly Leu Val Arg Thr Ala Ala Ser Ala Gin Tyr Arg Gly Val Leu Gly 

Thr Met Val Arg Thr Glu Gly Pro Arg Ser Leu Tyr Asn 

r-t r\ 7b 



35 

1 Arg Thr Ala ; 

55 

Thr He Leu 

ly Leu val Ala Gly Leu Gin Arg Gin Met Ser Phe Ala Ser Val Arg 

85 

II. Gly Leu Tyr Asp Ser Val Lys Gin Phe Tyr Thr Lys Gly Ser Glu 

100 xvo 

, r*~\ Ti* riv Ser Arg Leu Leu Ala Gly Ser Thr Thr Gly Ala 
His Ala Gly He Gly ser Arg xie 12g 

115 12U 
L eu Ala Val Ala Val Ala Gin Pro Thr Asp Val Val Lys Val Arg Phe 

Gin Ala Gin Ala Arg Ala Gly Gly Gly Arg Arg Tyr Gin Ser Thr Val 
145 150 

Glu Ala Tyr Lys Thr He Ala Arg Glu Glu Gly He Arg Gly Leu Trp 

Lys oiy Thr Ser Pro Asn Val Ala Arg Asn Ala lie Val Asn Cys Ala 

180 105 

Glu Leu Val Thr Tyr Asp Leu lie Lys Asp Thr Leu Leu Lys Ala Asn 



195 

Leu Met Thr Asp Asp Leu Pro Cys His Phe Thr Ser Ala Phe Gly Ala 



210 215 



Gly Phe Cys Thr Thr Val lie Ala Ser Pro Val Asp Val Val Lys Thr 

225 230 

Arg Tyr Met Asn Ser Ala Leu Gly Gin Tyr His Ser Ala Gly Hi. Cys 

3 245 2b0 

Ala Leu Thr Met He Arg Lys Glu Gly Pro Arg Ala Phe Tyr Lys Gly 

260 265 

Phe Met Pro Ser Phe Leu Arg Leu Gly Ser Trp Asn Val Val Met Phe 

275 280 ' 

Val Thr Tyr Glu Gin Leu Lys Arg Ala Leu Met Ala Ala Tyr Gin Ser 

290 295 

Axg GIxl Ala Pro Phe 
305 
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(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = " DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GGACTCACAG GTAAGACCCC 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS ; 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
TCTCCTGCAG CCCCACCGCT 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 
CCGCCTGCAG GTAGGTGCCC 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
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(A) DESCRIPTION: /desc = "DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Ala Thr Cys Cys Ala Gly 

1 5 

Gly Gly Gly Gly 

20 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

i (A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

t±±) MOLECULE TYPE: other nucleic acid 
( (A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GGCGCGGACA GTGAGTGACC 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

( C) STRANDEDNESS : s ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA" 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

20 

CCCCTCCCAG ACTCCAGCCT 
(2) INFORMATION FOR SEQ ID NO: 1*7: 

* 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc * "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

20 

CTGTGGAAAG GTAGGTCTGG 
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(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

( c ) STRANDEDNESS : s ingle 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:18: 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gly Ala Ala Cys Thr Thr 
! 5 io 15 

Thr Gly Cys Cys 

20 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 
CTGCTCACTG GTGAGGCCCT 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 
TCCTCTGCAG ACAACTTCCC 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D ) TOPOLOGY : 1 inear 

ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
TCTACAAGGG GTGAGCCTCC 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
TTCTTATCAG ATTTACACCC 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

< C ) STRANDEDNESS : s ingle 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 
GGACTACCAC CTGCTCACTG 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /dcoc « "DNA" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
CCCGTAACAT ATGGACTTT 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 

TTCACCACGT CCACCCGGGG GGATGCCACC 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 
AATAAA 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 403 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 

Arg Arg Gly His Pro lie Pro Ala Ala Thr Ser Trp Asp Gly Ala Leu 
15 10 15 

Gly Ser Pro Cys Ala Ala Pro Ala Val Ala Gly lie Thr Ala Pro Pro 

20 25 30 

Leu His Ser Pro Gly Leu Trp Ser Ser Leu Ser Pro Trp Thr Sex Sex 
35 40 45 
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Arg Pro Arg Asp Trp Ala Glu Pro Ser Arg Thr Met Val Gly Leu Lys 



50 55 



Pro Ser Asp Val Pro Pro Thr Met Ala Val Lys Phe Leu Gly Ala Gly 
65 70 75 

lie Ala Ala Cys Phe Ala Glu Leu Val Thr Phe Pro Leu Asp Thr Ala 

85 90 

Lys val Arg Leu Gin lie Gin Gly Glu Asn Gin Ala Val Gin Thr Ala 



Arg Leu Val Gin Tyr Arg Gly Val Leu Gly Thr lie Leu Thr Met Val 

H5 120 j.« 

Arg (Thr Glu Gly Pro Cys Ser Pro Tyr Asn Gly Leu Val Ala Gly Leu 

130 135 
Gin Arg Gin Met Ser Phe Ala Ser He Arg lie Gly Leu Tyr Asp Ser 
145 150 155 

Val Lys Gin Val Tyr Thr Pro Lys Gly Ala Asp Asn Ser Ser Leu Thr 

165 170 

Thr Arg lie Leu Ala Gly Cys Thr Thr Gly Ala Met Ala Val' Thr Cys 

180 185 190 

Ala Gin Pro Thr Asp Val Val Lys Val Arg Phe Gin Ala Ser lie His 
195 200 205 

Leu Gly Pro Ser Arg Ser Asp Arg Lys Tyr Ser Gly Thr Met Asp Ala 

210 215 
Tyr Arg Thr lie Ala Arg Glu Glu Gly Val Arg Gly Leu Trp Lys Gly 
225 230 

Thr Leu Pro Asn lie Met Arg Asn Ala He Val Asn Cys Ala Glu Val 



245 250 

Val Thr Tyr Asp He Leu Lys Glu Lys Leu Leu Asp Tyr His Leu Leu 

260 265 2/0 



Thr Asp Asn Phe Pro Cys His Phe Val Ser Ala Phe Gly Ala Gly Phe 
275 280 285 

Cys Ala Thr Val Val Ala Ser Pro Val Asp Val Val Lys Thr Arg Tyr 
Y 290 295 300 

Met Asn Ser Pro Pro- Gly Gl-n Tyr Phe Ser Pro Leu Asp Cys Met lie 
305 3 10 315 320 

Lys Met Val Ala Gin Glu Gly Pro Thr Ala Phe Tyr Lys Gly Phe Thr 
* 325 330 335 

Pro Ser Phe Leu Arg Leu Gly Ser Trp Asn Val Val Met Phe V,al Thr 

340 ' 345 350 

Tvr Glu Gin Leu Lys Arg Ala Leu Met Lys Val Gin Met Leu Arg Glu 
355 3 60 365 

Ser Pro Phe Tyr Arg Gin Glu Gly Hio Trp Leu Thr Cys Pro Lys Pro 
370 375 580 
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Val Lys Asn Gly Arg Lys Arg Cys lie His Ala His Met *Asp Thr Asp 
385 390 395 400 

Pro His lie 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 387 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 
i (D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 

Gly Gly Ala lie Gin Ser Leu Leu Pro Pro Pro Gly Met Glu Pro Gly 
15 10 15 

Ala Pro Val Leu Pro Leu Pro Trp Gin Asp Ser Gin Pro His Arg Cys 

20 25 30 

Thr Glu Ala Gin Gly Cys Gly Ala Ala Ser Leu Leu Gly Pro Pro Leu 
35 40 45 

Gly Pro Lys Gly Thr Gly Gin Ser Leu Pro Gly Leu Trp Leu Asp Ser 
50 55 60 

Leu Gin Thr Cys Leu Pro Pro Trp Leu Ser Ser Trp Gly Gin Ala Gin 
65 70 75 80 

Gin Pro Val Leu Leu Asn Ser Leu Pro Phe His Trp Thr Gin Pro Arg 

85 90 95 

Ser Ala Cys Arg Ser Arg Gly Arg Thr Arg Arg Ser Arg Arg Pro Gly 

100 105 HO 

Ser Cys Ser Thr Val Ala Cys Trp Ala Pro Ser Pro Trp Cys Gly Leu 
115 120 125 

Arg Val Pro Ala Ala Pro Thr Met Gly Trp Trp Pro Ala Cys Ser Ala 
130 135 140 

Arg Ala Ser Pro Pro Ser Ala Ser Ala Ser Met Thr Pro Ser Ser Arg 
145 150 155 160 

Cys Thr Pro Pro Lys Ala Arg Thr Thr Pro Ala Ser Leu Pro Gly Phe 

165 170 1,75 

Trp Pro Ala Ala Pro Gin Glu Pro Trp Arg Pro Val Pro Ser Pro Gin 

180 185 190 

Met Trp Arg Ser Asp Phe Arg Pro Ala Tyr Thr Ser Gly His Pro Gly 
195 200 205 
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Ala Thr Glu Asn Thr Ala Gly Leu Trp Thr Pro Thr Glu Pro Ser Pro 



225 



210 

Gly Arg Lys Glu Ser Gly Ala Cys Gly Lys Glu Leu Cys Pro Thr Ser 



215 



Gly Met Leu Ser Ser Thr Val Leu Arg Trp Pro Thr Thr Ser Ser Arg 

2 5 

Arg Ser Cys Trp Thr Thr Thr Cys Ser Leu Thr Thr Ser Pro Ala Thr 

260 265 

Leu Ser Leu Pro Leu Glu Pro Ala Ser Val Pro Gin Trp. Trp Pro Pro 

275 280 
Arg Trp Thr Trp Arg Pro Gly lie Thr His Leu Gin Ala Ser Thr Ser 



290 



295 



Ala Pro Ser Thr Val Arg Trp Trp Pro Arg Arg Ala Pro Gin Pro Ser 

305 ■ 3iU 

Thr Arg Asp Leu His Pro Pro Phe Cys Val Trp Asp Pro Gly Thr Trp 

325 

Cys Ser Pro Met Ser Ser Asn Gly Pro Lys Ser Arg Cys Tyr Gly Asn 
* 340 345 Jb0 

His Arg Phe Glu Gin Asp Lys Lys Ala Thr Gly Ser Arg Val Arg Asn 

355 360 
Gin Leu Arg Met Glu Glu Asn Gly Ala Ser Thr His Thr Trp Thr Gin 

370 375 

Thr His Thr 
385 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS : ^ 

(A) LENGTH: 399 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Glu Gly Pro Ser Asn Pro Cys Cys His Leu Leu Gly Trp Ser Pro Arg 
1 5 10 

Glu Pro Leu Cys Cys Pro Cys Arg Gly Arg Thr His Ser Pro Thr Ala 

20 25 30 

Ala Leu Lys Pro Arg Ala Val Glu Gin Pro Leu Ser Leu Asp Leu Leu 
35 40 45 

Ser Ala leu Lys Gly Leu Gly Arg Ala Phe Gin Gin Tyr Gly Trp Thr 
50 55 60 
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Glu Ala Phe Arg Arg Ala Ser His His Gly Cys Glu Val Pro Gly Gly 
65 70 75 80 

Arg His Ser Ser Leu Phe Cys Thr Arg Tyr Leu Ser Thr Gly His Ser 

85 90 95 

Gin Gly Pro Pro Ala Asp Pro Gly Gly Glu Pro Gly Gly Pro Gin Gly 

100 105 110 

Pro Ala Arg Ala Val Pro Trp Arg Ala Gly His His Pro Asp His Gly 
115 120 125 

Ala Asp Gly Ser Leu Gin Pro Leu Gin Trp Ala Gly Gly Arg Pro Ala 
130 135 140 

Ala (Pro Asp Glu Leu Arg Leu His Pro His Arg Pro Leu Leu Arg Gin 
145 150 155 160 

Ala Gly Val His Pro Gin Arg Arg Gly Gin Leu Gin Pro His Tyr Pro 

165 170 175 

Asp Phe Gly Arg Leu His His Arg Ser His Gly Gly Asp Leu Cys Pro 

180 185 190: • 

Ala His Arg Cys Gly Glu Gly Pro lie Ser Gly Gin His Thr Pro Arg 
195 200 205 

Ala lie Gin Glu Arg Gin Lys lie Gin Arg Asp Tyr Gly Arg Leu Gin 
210 215 220 

Asn His Arg Gin Gly Gly Arg Ser Gin Gly Pro Tyr Glu Arg Asn Phe 
225 230 235 240 

Ala Gin His His Glu Glu Cys Tyr Arg Gin Leu Gly Gly Gly Asp Leu 

245 250 255 

Arg His Pro Gin Gly Glu Ala Ala Gly Leu Pro Pro Ala His Gin Leu 

260 265 270 

Pro Leu Pro Leu Cys Leu Cys Leu Trp Ser Arg Leu Leu Cys His Ser 
275 280 285 

Gly Gly Leu Pro Gly Gly Arg Gly Glu Asp Pro Val Tyr Glu Leu Thr 
290 295 300 

Ser Arg Pro Val Leu Gin Pro Pro Arg Leu Tyr Asp Lys Asp Gly Gly 
305 310 315 320 

Pro Gly Gly Pro His Ser Leu Leu Gin Gly lie Tyr Thr Leu Leu Phe 

325 330 335 

Ala Phe Gly lie Leu Glu Arg Gly Asp Val Arg Asn Leu Ala Ala Glu 

340 345 350 

Thr Gly Pro Asp Glu Ser Pro Asp Val Thr Gly lie Thr Val ^eu Asn 
355 360 365 

Lys Thr Arg Arg Pro Leu Val Ala Lys Val Ser Glu Thr Ser Glu Trp 
370 375 380 

Lys Lys Thr Val His Pro Arg Thr His Gly His Arg Pro Thr His 
385 390 395 



• 
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(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 339 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Arg Gly His Pro He Pro Ala Ala Thr Ser Trp Asp Gly Ala Leu Gly 

5 10 " 

Pro Cys Ala Ala Pro Ala Val Ala Gly Leu Thr Ala Pro Pro Leu 



Ser 



25 30 



ser Pro Gly Leu Trp Ser Ser Leu Ser Pro Trp Thr Ser Ser Arg Pro 

35 40 45 , _ 

Aro Asp Trp Ala Glu Pro Ser Arg Thr Met Val Gly Leu Lys Pro Ser 

50 55 60 

Asp Val Pro Pro Thr Met Ala Val Lys Phe Leu Gly Ala Gly Thr Ala 
65 70 75 

Ala Cys Phe Ala Glu Leu Val Thr Phe Pro Leu Asp Thr Ala Lys Val 
y a5 90 95 

Arg Leu Gin lie Gin Gly Glu Asn Gin Ala Val Gin Thr Ala Arg Leu 
3 100 105 110 

Val Gin Tyr Arg Gly Val Leu Gly Thr He Leu Thr Met Val Arg Thr 

115 120 125 

Glu Gly Pro Cys Ser Pro Tyr Asn Gly Leu Val Ala Gly Leu Gin Arg 
130 135 1 

Gin Met Ser Phe Ala Ser He Arg He Gly Leu Tyr Asp Ser Val Lys 
145 150 1= 5 

Gin Val Tyr Thr Pro Lys Gly Ala Asp Asn Ser Ser Leu Thr Thr Arg 

6 *^ ^ 

He Leu Ala Gly Cys Thr Thr Gly Ala Met Ala Val Thr Cys Ala Gin 

180 185 190 

Pro Thr Asp Val Val Lys Val Arg Phe Gin Ala Ser He His Leu Gly 
19 5 200 205 

pro Ser Arg Ser Asp Arg Lys Tyr Ser Gly Thr Met Asp Ala Tyr Arg 
210 215 220 

Thr He Ala Arg Phe Glu Gly Val Arg Gly Leu Trp Lys Gly Thr Leu 
225 230 235 240 

Pro Asri lie Met Arg Asn Ala He Val Asn Cys Ala Glu Val Val Thr 

245 250 255 
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Tyr Asp lie Leu Lys 

260 

Phe Pro Cys His Phe 
275 

Val Val Ala Ser Pro 
290 

Pro Pro Gly Gin Tyr 
305 

Ala Gin Glu Gly Pro 

325 

Gin His Ser 



Glu Lys Leu Asp Tyr His 

265 

Val Ser Ala Phe Gly Ala 
280 

Val Asp Val Val Lys Thr 
295 

Phe Ser Pro Leu Asp Cys 
310 315 

Thr Ala Phe Tyr Lys Gly 

330 



Leu Leu Thr Asp Asn 

270 

Gly Phe Cys Ala Thr 
285 

Arg Tyr Met Asn Ser 
300 

Met lie Lys Met Val 

320 

-t 

Ala Ser Ser Cys Leu 

335 



(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 31 amino acids 

(B) TYPE: amino acid 

( C ) S TRANDEDNE S S : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:31: 

Gly Ala lie Gin Ser Leu Leu Pro Pro Pro Gly Met Glu Pro Gly Ala 
15 10 15 

Pro Val Leu Pro Leu Pro Trp Gin Asp Ser Gin Pro His Arg Cys lie 

20 25 30 

Glu Ala Gin Gly Cys Gly Ala Ala Ser Leu Leu Gly Pro Pro Leu Gly 
35 40 45 

Pro Lys Gly Thr Gly Gin Ser Leu Pro Gly Leu Trp Leu Asp Ser Leu 
50 55 60 

Gin Thr Cys Leu Pro Pro Trp Leu Ser Ser Trp Gly Gin Ala Gin Gin 
65 70 75 80 

Pro Val Leu Leu Asn Ser Leu Pro Phe His Trp Thr Gin Pro Arg Ser 

85 90 95 

Ala Cys Arg Ser Arg Gly Arg lie Arg Arg Ser Arg Arg Pro Gly Ser 

100 105 110 

Cys Ser Thr Val Ala Cys Trp Ala Pro Ser Pro Trp Cys Gly Leu Arg 
115 120 125 

Val Pro Ala Ala Pro Thr Met Gly Trp Trp Pro Ala Cys Ser Ala Arg 
130 135 140 

Ala Ser Pro Pro Ser Ala Ser Ala Ser Met Thr Pro Ser Sex Arg Cys 
145 150 155 160 
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Thr Pro Pro Lys Ala Arg Thr Thr Pro Ala Ser Leu Pro Gly Phe Trp 

3L 6 5 

Pro Ala Ala Pro Gin Glu Pro Trp Arg Pro Val Pro Ser Pro Gin Met 

180 185 

Trp Arg Ser Asp Phe Arg Pro Ala Tyr Thr Ser Gly His Pro Gly Ala 

195 

Thr Glu Asn Thr Ala Gly Leu Trp Thr Pro Thr Glu Pro Ser Pro Gly 

210 215 
Arg Lys Glu Ser Gly Ala Cys Gly Lys Glu Leu Cys Pro Thr Ser Gly 
225 230 

Met Leu Ser Ser Thr Val Leu Arg Trp Pro Thr Thr Ser Ser Arg Arg 

245 

Ser Cys Trp Thr Thr Thr Cys Ser Leu Thr Thr Ser Pro Ala Thr Leu 

260 265 

Ser Leu Pro Leu Glu Pro Ala Ser Val Pro Gin Trp Trp Pro P?:o Arg 
275 280 285 

Trp lie Trp Arg Pro Gly lie Thr His Leu Gin Ala Ser Thr Ser Ala 

290 295 

Pro Ser Thr Val Arg Trp Trp Pro Arg Arg Ala Pro Gin Pro Ser Thr 
305 310 315 

Arq Gly Glu Pro Pro Pro Ala Ser Ser Thr Pro 

325 330 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS : 

. (A) LENGTH: 340 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Glu Gly Pro Ser Asn Pro Cys Cys His Leu Leu Gly Trp Ser Pro Arg 
! 5 1° 

Glu Pro Leu Cys Cys Pro Cys Arg Gly Arg Thr His Ser Pro Thr Ala 

20 25 JU 

Ala Leu Lys Pro Arg Ala Val Glu Gin Pro Leu Ser Leu Asp Leu Leu 

40 " 



35 



Ser Ala- Leu Lys Gly Leu Gly Arg Ala Phe Glu Asp Tyr Gly Trp Thr 
50 55 60 
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Glu Ala Phe Arg Arg Ala Ser His His Gly Cys Glu Val pro Gly Gly 
65 70 75 80 

Arg His Ser Ser Leu Phe Cys Thr Arg Tyr Leu Ser Thr Gly His Ser 

85 90 95 

Gin Gly Pro Pro Ala Asp Pro Gly Gly Glu Pro Gly Gly Pro Asp Gly 

100 105 110 

Pro Ala Arg Ala Val Pro Trp Arg Ala Gly His His Pro Asp His Gly 
115 120 125 

Ala Asp Gly Ser Leu Gin Pro Leu Gin Trp Ala Gly Gly Arg Pro Ala 
130 135 140 

Ala Pro Asp Glu Leu Arg Leu His Pro His Arg Pro Leu Leu Arg Gin 
145 150 155 160 

Ala Gly Val His Pro Gin Arg Arg Gly Gin Leu Gin Pro His Tyr Pro 

165 170 175 

Asp Phe Gly Arg Leu His His Arg Ser His Gly Gly Asp Leu Cys Pro 

180 185 190 

Ala His Arg *Cys Gly Glu Gly Pro lie Ser Gly Gin His Thr Pro Arg 
195 200 205 

Ala lie Gin Glu Arg Gin Lys lie Gin Arg Asp Tyr Gly Arg Leu Gin 
210 215 220 

Asn His Arg Gin Gly Gly Arg Ser Gin Gly Pro Val Glu Arg Asn Phe 
225 230 235 240 

Ala Gin His His Glu Glu Cys Tyr Arg Gin Leu Cys Gly Gly Asp Leu 

245 250 255 

Arg His Pro Gin Gly Glu Ala Ala Gly Leu Pro Pro Ala His Cys Leu 

260 265 270 

Pro Leu Pro Leu Cys Leu Cys Leu Trp Ser Arg Leu Leu Cys His Ser 
275' 280 285 

Gly Gly Leu Pro Gly Gly Arg Gly Glu Asp Pro Val Tyr Glu Leu Thr 
290 295 300 

Ser Arg Pro Val Leu Gin Pro Pro Arg Leu Tyr Asp Lys Asp Gly Gly 
305 310 315 320 

Pro Gly Gly Pro His Ser Leu Leu Cys Gly Val Ser Leu Leu Leu Pro 

325 330 335 

Pro Ala Leu Pro 

340 



(2) INFORMATION FOR SEQ ID NO: 33: 

, (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 97 amino acids 

(B) TYPE: amino acid 

<C) STRAND EDNESS : unknown 
(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

Glu Thr Thr Val Asn Gly Glu Ala Arg Pro Ser Asp Pro Ala Ala Thr 
1 5 10 ib 

Trp Ser Cys Ala He Gly Trp Pro Cys Thr Thr Gin Pro Trp Leu Asp 

20 25 

Ala Gin Leu Pro Pro Thr Glu Ala Lys Asp Cys Gin Ala Ser Ser Leu 
35 40 45 

Leu Gly Pro Pro Ala Ala Lys Glu Pro Gly Pro Phe Pro Gly Thr Met 

50 55 60 

Val Gly Leu Gin Pro Ser Glu Val Pro Pro Thr Thr Val Val Lys Phe 
65 70 

Leu Gly Ala Gly Thr Ala Ala Cys Phe Ala Asp Leu Leu Thr Phe Pro 

85 90 *T 

Leu Asp Thr Ala Lys Val Arg Leu Gin lie Gin Gly Glu Asn Pro Gly 

100 105 H° 

Ala Cys Ser Val Gin Tyr Arg Gly Val Leu Gly Thr lie Leu Thr Met 
115 120 125 

Val Arg Thr Glu Gly Pro Arg Ser Pro Tyr Ser Gly Leu Val Ala Gly 
130 140 

Leu His Arg Gin Met Ser Phe Ala Ser lie Arg lie Gly Leu Tyr Asp 
145 150 1=5 

Ser Val Lys Gin Phe Tyr Thr Pro Lys Gly Ala Asp His Ser Ser Val 

165 170 l /b 

Ala He Arg He Leu Ala Gly Cys Thr Thr Gly Ala Met Ala Val Thr 

180 185 I 90 

Cvs Ala Gin Pro Thr Asp Val Val Lys Val Arg Phe Gin Ala Met He 
y 195 200 205 

Ara Leu Gly Thr Gly Gly Glu Arg Lys Tyr Arg Gly Thr Met Asp Ala 
y 210 215 220 

Tyr Arg Thr He Ala Arg Glu Glu Gly Val Arg Gly Leu Trp Lys Gly 

225 230 235 

Thr Trp Pro Asn He Thr Arg Asn Ala He Val Asn Cys Ala Glu Met 

245 250 255 

Val Thr Tyr Asp He He Lys Glu Lys Leu Leu Glu Ser His Leu Phe 

260 265 270 

Thr Asp Asn Phe Pro Cys His Phe Val Ser Ala Phe Gly Ala Gly Phe 
275 280 285. 

Cys Ala Thr Val Val Ala Ser Pro Val Asp Val Val Lys Thr Arg Tyr 
230 295 300 
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Met Asn Ala Pro Leu Gly Arg Tyr Arg Ser Pro Leu His Cys Met Leu 
305 310 315 ' 320 

Lys Met Val Ala Gin Glu Gly Pro Thr Ala Phe Tyr Lys Gly Phe Val 

325 330 335 

Pro Ser Phe Leu Arg Leu Gly Ala Trp Asn Val Met Met Phe Val Tlir 

340 345 350 

Tyr Glu Gin Leu Lys Arg Ala Leu Met Lys Val Gin Val Leu Arg Glu 
355 360 365 

Ser Pro Phe Thr Arg Gin Ala Gly Cys Leu Glu Gin Asn Lys Ala Ser 
370 375 380 

Leu Pro Gly Thr Gin Ala His Thr Ser Glu Pro Cys Thr 
385 390 395 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 381 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 
<D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Arg Gin Gin Met Val Arg Pro Gly Arg Gin lie Leu Leu Leu Pro Asn 
15 10 15 

Gly Val Glu Pro Gly Gly Pro Ala Leu Pro Asn Leu Gly Thr His Ser 

20 25 30 

Phe Leu Pro Glu Leu Lys Gin Lys lie Ala Arg Gin Ala Leu Ser Ser 
35 40 45 

Asp Leu His Arg Gin Gin Arg Asn Gin Ala His Ser Pro Gly Pro Trp 
50 55 60 

Leu Asp Phe Ser Pro Pro Lys Cys Leu Pro Gin Arg Leu Ser Ser Trp 
65 70 75 80 

Gly Pro Ala Leu Arg Pro Val Leu Arg Thr Ser Ser Leu Phe Pro Trp 

85 90 95 

Thr Pro Pro Arg Ser Val Cys Arg Ser Lys Gly Arg Thr Gin Gly Leu 

100 105 110 

Arg Ala Cys Ser Thr Ala Val Cys Trp Val Pro Ser Leu Trp Cys Ala 
115 120 125 

Asp Arg Val Pro Ala Ala Pro Thr Ala Asp Trp Ser Leu Ala Cys Thr 
130 135 140 

Ala Arg Val. Leu Pro Pro Phe Glu Leu Ala Ser Thr Thr lie Ser Ser 
145 150 155 160 
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Ser Ser Thr Pro Pro Arg Glu Arg Thr Thr Pro Ala Ser Pro Ser Gly 

165 170 

Phe Trp Gin Ala Ala Arg Gin Glu Pro Trp Gin Pro Ala Pro Ser *ro 

180 185 

Arg Wet Trp Arg Ser Asp Phe Lys Pro Tyr Ala Trp Glu Leu Glu Glu 

Arc Gly Asn Thr Glu Gly Leu Trp Met Pro Thr Glu Pro Ser Pro Gly 

- 210 215 
Arg Lys Glu Ser Gly Ala Cys Gly Lys Gly Leu Gly Pro Thr Ser Gin 
225 230 

Glu Met Pro Leu Ser Thr Val Leu Arg Trp Pro Thr Thr Ser Ser Arg 

245 250 

Arg Ser Cys Trp Ser Leu Thr Cys Leu Leu Thr Thr Ser Pro Val Thr 

260 265 

Leu Ser Leu Pro Leu Glu Leu Ala Ser Val Pro Gin Trp Trp Pro Pro 
275 280 285 

Arg Trp Met Trp Arg Pro Asp Thr Thr Leu Pro Ala Gly Thr Ala Ala 
290 

Leu Cys Thr Val Cys Arg Trp Trp Leu Arg Arg Asp Pro Arg Pro Ser 
305 310 315 

Thr Lys Asp Leu Cys Pro Pro Phe Cys Val Trp Glu Leu Gly Thr Cys 

325 330 335 

Leu His Met Ser Asn Arg Gly Pro Lys Ser Arg Tyr Cys Gly Asn Leu 

340 345 " u 

Arq Phe Glu Gin Gly Lys Gin Ala Ala Trp Asn Arg Thr Lys Arg Leu 
355 360 365 

Cvs Leu Gly His Arg Pro Thr Arg Gin Asn Arg Ala Arg 
370 375 380 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 397 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNES S : unknown 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

Arg Asp Asn Ser Glu Trp Gly Pro Ala Val Arg Ser Cys Cys Tyr Leu 
1 5 10 15 

Met Glu Trp Ser Leu Arg Val Ala Leu His Tyr Pro Thr Leu Ala Arg 

20 25 30 
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Arg Thr Ala Ser Ser Leu Asn Ser Lys Arg Leu Pro Gly>Lys Leu Ser 
35 40 45 

Pro Arg Thr Ser lie Gly Ser Lys Gly Thr Arg Pro lie Pro Arg Asp 
50 55 60 

His Gly Trp Thr Ser Ala Leu Arg Ser Ala Ser His Asn Gly Cys Glu 
65 70 75 80 

Val Pro Gly Gly Arg His Cys Gly Leu Phe Cys Gly Pro Pro His Phe 

85 90 95 

t 

Ser Pro Gly His Arg Gin Gly Pro Ser Ala Asp Pro Arg Gly Glu Pro 

100 105 110 

Arg Gly Ser Glu Arg Ala Val Pro Arg Cys Ala Gly Tyr His Pro Asp 
115 120 125 

Tyr Gly Ala His Arg Gly Ser Pro Gin Pro Leu Gin Arg Thr Gly Arg 
130 135 140 

Trp Pro Ala Pro Pro Asp Glu Phe Cys Leu Met Ser Asn Trp Pro Leu 
145 150 155 1 160 

Arg Leu Cys Gin Ala Val Leu His Pro Gin Gly Ser Gly Pro Leu Gin 

165 170 175 

Arg Arg His Gin Asp Ser Gly Arg Leu His Asp Arg Ser His Gly Ser 

180 185 190 

Asp Leu Arg Pro Ala His Gly Cys Gly Glu Gly Pro He Ser Ser His 
195 200 205 

Asp Thr Pro Gly Asn Trp Arg Arg Glu Glu He Gin Arg Asp Tyr Gly 
210 215 220 

Cys Leu Gin Asn His Arg Gin Gly Gly Arg Ser Gin Gly Pro Val Glu 
225 230 235 240 

Arg Asp Leu Ala Gin His His Lys Lys Cys His Cys Gin Leu Cys Asp 

245 250 255 

Gly Asp Leu Arg His His Gin Gly Glu Val Ala Gly Val Ser Pro Val 

260 265 270 

Tyr Gin Leu Pro Leu Ser Leu Cys Leu Cys Leu Trp Ser Trp Leu Leu 
275 280 ~ 285 

Cys His Ser Gly Gly Leu Pro Gly Gly Cys Gly Lys Asp Pro He His 
290 295 300 

Glu Arg Ser Pro Arg Gin Val Pro Gin Pro Ser Ala Leu Tyr Ala Glu 
305 310 315 320 

Asp Gly Gly Ser Gly Gly Thr His Gly Leu Leu Gin Arg He Cys Ala 

325 330 335 

Leu Leu Ser Ala Ser Gly Ser Leu Glu Arg Asp Asp Val Cys Asn He 

340 345 350 

Ala Thr Glu Glu Gly Leu Ash Glu Ser Pro Gly Thr Ala Gly He Ser 
355 360 365 
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Val Leu Asn Lys Ala Ser Arg Leu Pro Gly Thr Glu Gin Ser Val Ser 
370 375 

Ala Trp Asp Thr Gly Pro His Val Arg Thr Val His Ala 
385 390 395 
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What is claimed: 

1. Isolated or recombinant nucleic acid which encodes a 
mammalian uncoupling protein 3. ; 

5 2. The nucleic acid of Claim 1 wherein the uncoupling 
protein 3 is human. 

3 . The nucleic acid of Claim 1 selected from the group 
consisting of: SEQ ID NO: 1, SEQ ID NO: 2 ar^d SEQ ID 
NO: 7. 

10 4 . The nucleic acid of Claim 1 wherein said nucleic acid 

hybridizes under stringent conditions with DNA 
selected from the group consisting of: SEQ ID NO: 1, 
the complement of SEQ ID NO : 1 # SEQ ID NO : 2 the 
complement of SEQ ID NO: 2, SEQ ID NO: 7 and the 

15 complement of SEQ ID NO: 7. 

5. The nucleic acid of Claim 1 wherein the nucleic acid 

encodes an amino acid sequence selected from the group 
consisting of: SEQ ID NO: 3, SEQ ID NO : 4, and SEQ ID 
NO. 8. 

20 6. A recombinant nucleic acid construct comprising the 

nucleic acid of Claim 1. 

7. The recombinant nucleic acid construct of Claim 6 

wherein the nucleic acid is selected from the group 
consisting of: SEQ ID NO: 1, SEQ ID NO: 2 and SEQ ID 
25 NO-. -7. 



The recombinant nucleic acid construct of Claim 6 
wherein the nucleic acid encodes the amino acid 
sequence selected from the group consisting of: 'SEQ ID 
NO: 3, SEQ ID NO 4, and SEQ ID NO: 8. 

The recombinant nucleic acid construct of Claim 6 
wherein the nucleic acid is operably linked -to an 

expression control sequence. 



- 1 



A host cell comprising the nucleic acid of Claim 1. 

The host cell of Claim 10 wherein the nucleic acid is 
operably linked to an expression control sequence, 
whereby mammalian uncoupling protein 3 is expressed 
when the host cell is maintained under conditions 
suitable for expression. 

A method for producing a mammalian uncoupling protein 
3 comprising: 

a) introducing into a host cell a nucleic acid 
construct comprising a nucleic acid which encodes 
a mammalian uncoupling protein 3; and 

b) maintaining the host cells produced in step a) 
under conditions whereby the nucleic acid is 
expressed and the mammalian uncopling protein 3 
is produced. 

An antibody or functional portion thereof which binds 
mammalian uncoupling protein 3 . 

A method of detecting mammalian uncoupling protein 3 
in a sample comprising: 

a) . contacting a sample with an antibody which binds 
•• uncoupling protein 3 , under conditions suitable 
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for specific binding of said antibody to the 
mammalian uncoupling protein 3; and 

b) detecting an antibody- mammalian uncoupling 
protein 3 complex, 
5 wherein if the antibody-mammalian uncoupling protein 

complex is detected, mammalian uncoupling protein 3 is 

present in the sample . 

15. A method of identifying an agent which alters 
uncoupling protein 3 activity comprising the steps of: 

10 a) introducing into a host cell a nucleic acid 

construct comprising a nucleic acid which encodes 
a mammalian uncoupling protein 3; 

b) maintaining the host cells produced in step a) 
under conditions appropriate for expression of 

15 the nucleic acid; 

c) contacting the cells of b) with the agent; and 

d) detecting mitochondrial electrical potential of 
the cells of c) in the presence of the agent, 

wherein a change in mitochondrial electrical potential 
2 0 in the presence of the agent indicates that the agent 

alters uncoupling protein 3 activity. 

16. The method of Claim 15 wherein the mitochondrial 
electrical potential is detected using fluorescence. 

17. A method of identifying an agent which is an activator 

2 5 of uncoupling protien 3 activity comprising the steps 

of: 

a) introducing into a host cell a nucleic acid 

construct comprising a nucleic acid which encodes 
a mammalian uncoupling protein 3; 

i 

3 0 b) maintaining the host cells produced in step a) 

under conditions appropriate for expression of 
the nucleic acid; 



trie cexxb ui ^ / j." ^ai^- ^^www-*ww 

wherein a reduction in mitochondrial electrical 
5 potential in the presence of the agent indicates that 

the agent is an activator uncoupling protein 3 
activity. 

t 

18. The method of Claim 17 wherein the mitochondrial 

'electrical potential is detected using fluorescence. 

10 19. A method of identifying an agent which is an inhibitor 

of uncoupling protein 3 activity comprising It he steps 

of: 

a) introducing into a host cell a nucleic acid 
construct comprising a nucleic acid which encodes 

15 a mammalian uncoupling protein 3; 

b) maintaining the host cells produced in step a) 
under conditions appropriate for expression of 
the nucleic acid; 

c) contacting the cells of b) with the agent; and 
20 .d) detecting mitochondrial electrical potential of 

the cells in the presence of the agent; 
wherein an increase in mitochondrial electrical 
potential in the presence of the agent indicates that 
the agent is an inhibitor uncoupling protein 3 
25 activity. 

20. The method of Claim 19 wherein the mitochondrial 
electrical potential is detected using fluorescence. 

21. A method of inhibiting protein catabolism in a mammal 
comprising administering to the mammal an effective 

30 amount of <m inhibitor of -uncoupling protein 3. 
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22. A method of enhancing protein catabolism in a mammal 
comprising adminstering to the mammal an effective 
amount of an enhancer of uncoupling protein 3. 

23. A method of inhibiting muscle wasting in a mammal 
5 comprising adminstering to the mammal an effective 

amount of an inhibitor of uncoupling protein 3 . 

24. Use of an inhibitor of uncoupling protein 3 in a 
method of inhibiting protein catabolism in a mammal, 
wherein the method comprises administering to the 

10 mammal an effective amount of an inhibitor of 

uncoupling protein 3. 

25. Use of an enhancer of uncoupling protein 3 in a method 
of enhancing protein catabolism in a mammal, wherein 
the method comprises administering to the mammal an 

15 effective amount of an enhancer of uncoupling protein 

3 . 



26. Use of an inhibitor of uncoupling protein 3 in a 

method of inhibiting protein catabolism in a mammal, 
wherein the method comprises administering to the 
20 mammal an effective amount of an inhibitor of 

uncoupling protein 3. 
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